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THE ABILITY TO JUDGE PEOPLE 
RONALD TAFT 


University of Western Australia’ 


What are the factors related to the 
ability to judge accurately such be- 
havioral characteristics as the abili- 
ties, traits, action tendencies, mo- 
tives, and emotions of other people ?? 
Are there some persons who consist- 
ently demonstrate good ability to 
judge others accurately and, if so, 
what are the correlates of such abil- 
ity? These are the general questions 
to which this review is addressed. 
The practical importance of the 
above questions in psychology is ob- 
vious, especially when we consider 
the role of the psychologist’s per- 
sonality in determining the validity 
of the observations and inferences 


with which he works. 


METHODS OF MEASURING ABILITY 
TO JupGE OTHERS 


The number of differing methods 
of measuring ability to judge others 
that have been used in the experi- 
ments in this area may partly ac- 
count for the varied and sometimes 
conflicting results found. The dis- 
tinction between analytic and non- 


' The major part of this review was pre- 
pared by the writer while he was a research 
assistant at the Institute of Personality Assess- 
ment at the University of California. He 
wishes to thank Professors D. W. MacKin- 
non and T.R. Sarbin for their advice and as- 
sistance which have been given unstintingly 

*The review is confined to judgments 
about the emotional, personality, and be- 
havioral characteristics of others, not their 
physical or sociological characteristics. 
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analytic judgments (Wallin, 75) ap- 
pears to be a particularly important 
one. In analytic judgments, the 
judge (J) is required to conceptu- 
alize, and often to quantify, specific 
characteristics of the subject (S) in 
terms of a given frame of reference.’ 
This mainly involves the process of 
inference, typical performances of 
J being rating traits, writing person- 
ality descriptions, and predicting the 
percentage of a group making a given 
response. In nonanalytic judgments, 
J responds in a global fashion, as in 
matching persons with personality 
descriptions and in making predic- 
tions of behavior. An empathetic 
process is usually involved in non- 
analytic judgments. 


A Classification of Tests of Ability 
to Judge Others 


The classification that follows is 
based on that suggested by Notcutt 
and Silva's review of the experimen- 
tal approaches (54). 

1. Perception of emotional expres- 
sions in photographs, drawings, models, 
and movies. This method has been 
used to study ability to judge in a 
number of studies (2 [ch. 8-10], 3, 
13, 18, 25, 26, 30, 33, 37, 38, 39, 73, 
76, 78). The required response may 
be a multiple-choice, a one-word free 


, Fr 


stanas for judge and S for subject. 
‘Throughout this review, the term “subject” 
refers to the person being judged. 
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response, or a completely free re- 
sponse, The criteria are usually S’s 
intention, or the judgment of psy- 
chologists. Less controversial criteria 
were used by Coleman (18), where J 
had to select from a check list the 
situation to which S was responding. 
This type of test usually evokes a 
nonanalytic judgment, although, as 
F. H. Allport (2) has demonstrated, 
these judgments can be made analyti- 
cally. The method has the advantage 
of being neat, but the expressions 
tend to be culturally stereotyped. In 
real-life situations the expression of 
emotions may be idiosyncratic (38, 
42) and thus their recognition may 
require a different type of ability from 
the recognition of stereotyped re- 
sponses. 

2. Rating and ranking of traits. 
This is an analytic method, and has 
the advantage of clear-cut quantifica- 
tion. It also has the virtue of requir- 
ing a performance which is frequently 
used in psychological work. It suffers 
from all the drawbacks of ratings in 
general, particularly the lack of con- 
sensus about the meaning of terms and 
the quantitative standards to be 
used. 

A further difficulty with this 
method of measuring ability to judge 
others is the establishment of criteria. 
Two different approaches to these 
criteria may be distinguished: (a) 
Peer judgments, i.e., pooled judg- 
ments made by the Ss themselves, 
which may or may not include the 
self-ratings (1, 6, 17, 24, 27, 32, 60, 
67, 73, 81). The use of this type of 
criterion suffers from the doubt 
whether we are measuring ability to 
judge or simply the degree to which 
J conforms to the criterion group; the 
nonconformist would score poorly, 
but might in fact be a good judge. (0) 
External criteria—these may be judg- 
ments made by other observers who 


may or may not be well acquainted 
with the Ss; or they may be derived 
from test results (3, 17, 23, 40, 52, 
$3, 67, 71, 73, 78). (Only Cogan 
et al. [17], Estes [23], and Vernon 
[73] used tests as well as ratings to 
provide the criteria.) Taft ‘%7) ob- 
tained an intercorrelation of .72 be- 
tween ratings using each of the two 
types of judgmental criteria, peer 
judgments and external judgments. 

3. Personality descriptions. The J 
is provided with some data about S 
and required to write a description of 
his personality. The data provided 
might be a brief interview with S, 
observation of S in some standard 
situation, or some descriptive ma- 
terial concerning him. This method, 
in general, involves analytic judg- 
ments, but suffers from the vagueness 
of J’s task; also the criterion lacks 
precision, and is usually based on the 
opinions of persons who are arbi- 
trarily regarded as ‘‘expert judges” 
(9, 12, 55). 

4. Personality matchings. In this 
method, J is required to match some 
data concerning S with some other 
data concerning him. Where the Ss 
are known personally to J, the task 
may be to match S with the relevant 
data. The method lends itself readily 
to nonanalytic judgments, but some 
Js may use analytic modes of infer- 
ence to assist them to make the 
matchings. The matching method 
(23, 67, 73, 81) has the great advan- 
tage over the previously described 
methods of studying the ability to 
judge others in that its criterion is 
completely objective, but it has the 
weakness that it constitutes an arti- 
ficial situation not paralleled in every- 
day life or in psychological practice. 

5. Prediction of behavior or life- 
history data. The J has some ac- 
quaintance with S or is given data 
about him, and his task is to predict 
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S’s performance on various test items 
or his responses to personality and 
attitude inventories, or to predict 
specified aspects of his life history. 
These are the so-called “empathy” 
tests (20) and are probably primarily 
nonanalytic. A related test is the 
“‘mass-empathy”’ test (72) in which J 
predicts the combined responses of a 
group of people. It is suggested that 
the ‘“‘mass-empathy” test of predic- 
tion is more likely to be tackled 
analytically than is the empathy 
test, as it does not lend itself so read- 
ily to empathizing with any particu- 
lar person. Thus the empathy test 
will be regarded as nonanalytic, and 
the mass-empathy test as analytic, 
wherever such a distinction is cogent 
to our argument. It is perhaps sig- 
nificant in this reference that a mass- 
empathy test (Kerr) and an empathy 
test (Dymond) were found to be 
uncorrelated for 87 subjects (8). The 
empathy method was used in fifteen 
other studies analyzed (9, 10, 20, 21, 
28, 34, 40 [ch. IIIC], 44, 54, 56, 57, 
59, 73, 78, 79) and the mass-empathy 
method in ten (16, 40, 41, 66, 67, 68, 
69, 70, 72, 74). 

The method of behavioral predic- 
tion, like the matching method, has 
the advantage of possessing an objec- 
tive criterion. The reliability of the 
predictions on any one item tends to 
be low, and, therefore, the test should 
preferably consist of a number of 
items or behavioral events to be pre- 
dicted. One weakness of the empathy 
method, demonstrated by Hastorf 
and Bender (11, 35) is the spurious 
effect of projection; the predictions 
made by J are often partly the result 
of projection of his own personality, 
and, consequently, an accidental re- 
semblance between J and S will ren- 
der the predictions more accurate 
than they otherwise would be. Has- 
torf and Bender suggest the use of an 


index separating the effect of projec- 
tion, similarity, and empathy. A fur- 
ther weakness of the “empathy” 
method is the possibility that judg- 
ments may often be made correctly 
by using cultural stereotype responses 
without attempting to predict the 
responses of the particular S. Gage 
(28) has demonstrated how accuracy 
of stereotype predictions can be kept 
separate from accuracy of individual 
predictions. 

6. A few miscellaneous techniques 
should also be mentioned, although 
studies based on them, in general, 
fall outside the scope of this review. 
Most of these techniques employ 
indices that attempt to measure the 
ability to judge others in an indirect 
manner, e.g., Dymond’s empathy 
index on the TAT (17), Walton's 
generalized empathy test (76), and 
McClelland’s ‘“‘role-playing ability” 
scale (46). Even more remote are the 
tests devised by Chapin (15), Moreno 
and Moreno (49), Sherriffs (61), Moss 
Social Intelligence Test (50 used in 
73). Most of these measures are 
either assumed to possess face valid- 
ity or have been validated against 
one of the first five methods of testing 
ability to judge others. 


Is THERE A GENERAL ABILITY 
TO JuDGE OTHERS? 


Is there sufficient consistency in 
the ability to judge others for persons 
to be characterized as good or poor 
judges? We are here concerned with 
the generality and specificity of the 
ability to judge others and with test- 
retest reliability of the measures 
used. G. W. Allport suggests that the 
ability to judge others is analogous to 
artistic ability in that it is neither en- 
tirely general nor entirely specific. 
“It would be unreasonable, therefore, 
to expect a judge of people to be uni- 
formly successful in estimating every 
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quality of every person. ... It seems 
more of an error, however, to con- 
sider the ability entirely specific than 
to consider it entirely general” (5, p. 
512 and footnote). Let us now look 
at the experimental evidence which, 
on the whole, supports Allport’s con- 
tention. 


Consistency Between Different 
Types of Tests 


Some persons may be better at 
judging others on analytical tests, 
while others may be better on non- 
analytic tests. F. H. Allport (2) 
found that, in judging emotional ex- 
pression (Rudolph poses), some Js 
were superior at judging the intended 
emotion when using a naive type of 
intuitive method, while other Js were 
superior after receiving a training in 
the use of analytic methods of making 
the same judgments. Using 44 meas- 
ures of ability to judge, Vernon (73) 
obtained significant correlations of 
over .30 for-S’s ability to rate stran- 
gers (analytic) and his performance on 
subtests of the Moss Social Situations 
Test involving nonanalytic judg- 
ments about people (Social Situations 
and Observation of Human Be- 
havior). Wedeck (78) obtained sig- 
nificant positive correlations ranging 
from .18 to .56 (mean .31) between 
a test of ability to rate the personality 
traits of verbally described persons 
and seven other tests of ability to 
judge emotions and personal qualities 
of persons depicted in various ways, 
both pictorial and verbal. The Ss 
consisted of 203 adolescent school 
girls. A factor analysis revealed a 
“psychological factor” with a high 
saturation in the judgment of emo- 
tions tests (nonanalytic) but a negli- 
gible one in the trait-rating tests 
(analytic). Wolf and Murray (81) 
using four Js found a “fairly consist- 
ent’’ rank order between the accuracy 


of J's predictions of the 15 Ss’ rank 
order on three objective tests and J's 
ability to match the Ss with their 
Dramatic Production Test records. 
Taft (67) obtained a significant cor- 
relation of .36 between the ability to 
rate others on traits and the ability 
to predict group responses on an in- 
ventory. On the other hand, a test of 
ability to match the Ss with their 
mosaic productions did not correlate 
significantly with these two analytic 
tests. Neither did an empathy test 
with a mass-empathy test (see p. 3 
supra). In general then, significant 
but low consistency has usually been 
found between one test of ability to 
judge others and another, but some 
studies suggest that analytic and non- 
analytic tests tend to differ in their 
results. We shall therefore need to 
note in our review the type of test 
used in the particular experiments 
reported. 


Reliability of Tests 


Adams (1) reports an average test- 
retest reliability of .55 for the ac- 
. >*v of sorority girls in ranking 
nine of their “sisters’’ on 63 person- 
ality traits, the interval between the 
two tests being approximately three 
weeks. The criteria were the pooled 
rankings. This index of reliability 
indicates that there is some consist- 
ency for a particular judge in making 
specific judgments, but that the con- 
sistency is not high. If this is typical, 
the influence of attenuation on cor- 
relation coefficients using such data 
would be considerable. In fact, only 
a few of the studies report even mod- 
erately high reliabilities. Dymond 
(21) reports a split-half reliability for 
her empathy test (predictions of S's 
ratings of himself and of J on six 
characteristics) of .82 and a test- 
retest reliability after six weeks of .60 
(20). Travers (69) asked his Ss to 








THE ABILITY TO JUDGE PEOPLE 5 


predict the percentage of a specified 
population who would answer ‘‘true”’ 
or “‘false’’ to each of 25 items, and the 
split-half reliability of the accuracy 
scores for this test was .64. 

Taft (67) obtained a test-retest re- 
liability coefficient of .82 for a 30- 
item prediction test similar to that of 
Travers but this contrasted with a 
split-half reliability of only .20. Cor- 
relations between single items tend to 
be even lower than .20 (67, 69, 74), a 
further indication of the low reliabil- 
ity of any one item. We thus see the 
necessity for tests of this nature to 
include a large number of items in 
order to ensure reliability. 


Consistency in Accuracy Between 
Traits and Between Subjects 


Travers, in a further experiment 
(70), found correlations of .44 and 
47 respectively for two groups of 
students (N, 26 and 31) between 
ability to judge the word knowledge 
of S’s own group and ability to judge 
that of the general population. ‘‘The 
evidence then indicates that some 
subjects are generally good at judging 
what various groups of men know, 
while others are poor’’ (70, p. 98). 
Taft’s study yielded a corresponding 
correlation of .31 for 40 Ss on tests 
analogous to those of Travers. 

Vernon (73) did not compute the 
reliabilities of the tests which he 
used, but, on the basis of “logically 
related aggregates,”’ i.e., clusters, he 
suggests that his data show four inde- 
pendent dimensions—ability to judge 
self, to judge acquaintances, to rate 
strangers, and to judge character 
sketches of strangers. Kelly and 
Fiske (40, ch. IIIC) attempted to de- 
vise a test of the ability of psycholo- 
gists to predict inventory responses of 
two patients whom the latter had 
diagnosed. The respective accuracies 
achieved in judging each patient in- 


tercorrelated .23 for 100 Js. In con- 
trast to this low consistency, Estes 
(23) reports that his ten best judges 
in a test of ability to rate the traits 
of persons depicted in short movies 
were consistently more accurate than 
the ten poorest judges on all 23 vari- 
ables and for all eight Ss. The cri- 
teria of accuracy were the ratings as- 
signed by clinical psychologists. 

Gage (28) also reports high gener- 
ality in the ability of his Js to judge 
the responses of six strangers on the 
Kuder Preference Inventory after 
they had directly observed S's ex- 
pressive behavior for a short time. 
He found high consistency in J's 
ability to judge irrespective of the 
specific items or Ss on which he was 
being tested. Consistency (correla- 
tion of .71) was also shown between 
ability to predict the responses of the 
six strangers and those of three ran- 
domly selected classmates. Similar 
findings, with regard to consistency 
in J’s accuracy between different Ss 
are reported by Bender and Hastorf 
(11). 

Luft (45) suggests that there might 
be individual differences in consist- 
ency amongst Js. In one of his ex- 
periments using 74 Js, all persons 
with various degrees of training in 
psychology, he found that 27 per 
cent of the Js were able to predict the 
personal inventory responses of both 
of two Ss with above chance ac- 
curacy. The predictions were made 
on the basis of cage-record material. 
However, the accuracy scores of the 
total sample of 74 Js did not correlate 
significantly from one S to the other. 
These findings may be taken in con- 
junction with those of Dymond (20) 
in her experiment referred to above. 
She found that the capable judges 
tended to show less variation in the 
accuracy with which they could judge 
the S's self-ratings on each of six 
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traits than did poorer judges. Thus, 
good judges, at least, seem to show 
some consistency in ability to judge 
irrespective of the type of S or the 
type of qualities being judged. We 
could not expect, however, that even 
a capable judge would be able to 
judge members of another culture as 
well as he can judge members of his 
own; there is evidence that judg- 
ments are more accurate when J and 
S are similar in cultural backgrounds, 
also in age and sex (34, 36, 67, 71, 81). 
Nor could even a good judge be ex- 
pected to judge all areas of the per- 
sonality or use different kinds of 
“clinical evidence’ equally well. 
(Space does not permit the inclusion 
of a systematic review of the experi- 
ments that have been performed on 
these specific factors in judging [67, 
pp. 12—23)). 

Conclusion. Allport’s dictum at the 
beginning of this section seems to be 
justified by the data. A reasonable 
conclusion might be stated as follows: 
the degree to which a person can 
make accurate judgments about 
others is a function of his general 
ability to judge and of specific situ- 
ational and interactional factors, but 
the greater his general ability to 
judge, the less will be the relative in- 
fluence of the specific factors. The 
specific factors concerned are the 
type of S, the relationship between 
J and S, the type of judgment de- 
manded, the traits being judged, and 
the material available to J. In addi- 
tion, the consistency in the perform- 
ance of any individual J is limited by 
the low reliability that characterizes 
many of the tests used to measure 
ability to judge. In view of the effect 
of specific factors we must be cautious 
in drawing general conclusions from 
studies that do not require J to make 
a variety of judgments about a va- 
riety of Ss. 


THe CHARACTERISTICS OF Goop 
JupGeEs or OTHERS 


In consequence of the above con- 
clusions, some of the findings quoted 
in the review that follows should be 
limited to the actual Ss, judging situ- 
ations, traits, or modes used in the 
particular experiments. Wherever 
similar findings are obtained over 
several situations, we may expect 
that the individual characteristics of 
the good judges concerned can be 
generalized, but where contradictory 
findings are obtained between one 
study and another the variation may 
merely be due to the specific factors 
involved in the experiments. Our 
plan has been to report findings as 
conclusions where the weight of the 
evidence clearly supports them, even 
though some contradictory findings 
may raise doubts. 


Age 

Gates (30) found a progressive in- 
crease from the ages of 3 to 14 years 
in the ability to judge the intended 
emotional expressions in six of the 
Ruckmick poses. Walton (76), using 
various tests of empathy with emo- 
tional expression, obtained similar 
results for Js ranging from kinder- 
garten to university age. His pub- 
lished tables seem to suggest that the 
greatest improvements occur be- 
tween 9 and 11, and between 14 and 
adulthood. Further support for the 
development of empathy in children 
is provided by Dymond et al. (22) 
where a marked increase in signs of 
empathy is reported between 7 and 
11 years. The test consisted of pre- 
dictions by the Js of whether their 
classmates liked or disliked them. 

On the other hand a number of 
studies using adults as Js failed to 
find any increase with age in ability 
to judge others (16, 23, 41, 56, 64, 67). 
These studies employed Js ranging 
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from 18 to at least the late thirties, 
but the tests mostly tested their 
ability to predict the responses of 
college age Ss. Consequently, the 
older Js would be less similar to their 
Ss than would the younger Js and 
the former may therefore have been 
under a handicap. 

Conclusions. While ability to judge 
emotional expressions increases with 
age in children—probably through 
experience—no increase with age has 
been found in adults on various tests 
of ability to judge others. This latter 
finding, however, is subject to the 
limitation that the subjects being 
judged have mostly been closer in age 
to the younger judges. 


Sex 


It has been contended that “Ex- 
perimental studies, so far as they go, 
establish only a slight margin in favor 
of women”’ (5, p. 517). This seems to 
be a reasonable summary of the many 
studies that have been conducted on 
sex differences in ability to judge 
people. One study (37) reports a sig- 
nificant superiority for women stu- 
dents in judging the emotional ex- 
pressions in the Rudolph poses, but 
in the other studies of judging emo- 
tional expressions in photographs, 
models, and movies, no significant or 
consistent sex differences are re- 
ported. In five studies (2, 18, 25, 30, 
33) the authors report no differences; 
in three (13, 26, 39) a slight, but ap- 
parently insignificant, superiority for 
women; and in one (38) a slight supe- 
riority for men. The results are not 
related to the sex of the person whose 
emotions were being judged. 

Similar results have been found in 
other tests of judging ability. Dy- 
mond (20) reports no sex differences 
at first in the ability of students to 
predict how other members of their 
group rated themselves and J; how- 


“enaage es 


ever, in a second administration, six 
weeks later, the female Js were sig- 
nificantly better than the males. Ina 
further experiment (21), she found 
female students significantly superior 
to males in making judgments similar 
to those in her first experiment. 
These two studies differed from most 
of the other ones on sex differences in 
that the Js were well acquainted 
with the Ss. However, where hus- 
bands and wives predicted each 
other’s responses to an inventory 
(54), sex differences in accuracy were 
not significant; if anything, the males 
were the more accurate. 

None of the other five reported ex- 
periments (46, 56, 64, 69, 71) which 
compare the ability of males and fe- 
males either to rate their S’s traits or 
to predict their responses found sig- 
nificant differences. It is also inter- 
esting to note that no relationship 
was found between the ability of 
males on these types of tests of judg- 
ing and their scores on inventory and 
projective tests of femininity (67). 

Conclusions. The weight of evi- 
dence is in favor of no sex difference 
in ability to judge, or perhaps a 
slightly superior ability in women. 
The reported studies do not suggest 
any convincing explanations for this 
possible superiority of women, and 
until further evidence suggests a 
changed view, it would be wise to 
conclude that there are no differences. 


Family Background and 
Sibling Rank 


In a study of ability to predict how 
other boys would answer personal in- 
ventory questions (mass-empathy), 
Sweet (66) found a positive relation- 
ship between judging ability and the 
socioeconomic status of the Js. These 
results, however, could have been due 
to the influence of intelligence. The 
only other study reported on this 











8 RONALD TAFT 


topic, Taft (67), found no correlation 
among graduate students between 
ability to judge and socioeconomic 
status, where the test required J to 
rate S on six traits, and to predict 
their personal inventory responses. 
The criteria for the ratings were both 
peer and psychological assessments. 

Regarding size of family and order 
of birth, an earlier suggestion by F. 
H. Allport (2) that Js from larger 
families are better at judging emo- 
tional expression has not been sup- 
ported. (Allport used only 26 Js and 
does not report significance.) Estes 
(23) found no relation between “‘sib- 
ling status’’ and ability to judge on a 
trait-rating test and in Taft's study 
(67) there was actually a negative 
correlation between ability to judge 
and number of siblings. In this latter 
study it was also found that the best 
Js were only children; then came 
eldest, middle, and youngest children 
in that order. The explanation given 
is that the fewer the older siblings, 
the more likely is a child to be 
brought into contact early with adult 
modes of judging others. 

Taft also found that those Js who 
had a rural background were poorer 
at judging than the urban ones but 
this might be explicable again in 
terms of sibling status. Minority 
group status showed an interesting 
pattern in this study: Negroes and 
foreign students were poorer and 
Jewish students better than the aver- 
age in judging (see above for the 
details of the test used). The expla- 
nation given is that the former groups 
are too isolated and dissimilar from 
the others to judge them accurately, 
but the Jewish group possesses a 
marginal position in the American 
culture as a whole which provides 
both the opportunity and the stimu- 
lus for making accurate judgments. 

Conclusions. The findings reported 





on the influences of background fac- 
tors on ability to judge are derived 
from very few studies, and almost en- 
tirely from one. Thus no consisten- 
cies can be educed until we have more 
evidence. ; 


Intelligence and Perception 


G. W. Allport sums up the studies 
up to 1937 on the relationship be- 
tween ability to judge others and 
intelligence as follows: ‘‘Experimen- 
tal studies have found repeatedly 
that some relationship exists between 
superior intelligence and the ability 
to judge others...even within a 
high and narrow range of intelligence. 
... Understanding people is largely 
a matter of perceiving relations be- 
tween past and present activities, 
between expressive behavior and 
inner traits, between cause and effect, 
and intelligence is the ability to per- 
ceive just such relations as these”’ (5, 
p. 514). 

In an early study, Cogan et al. (17) 
found a significant correlation be- 
tween J's intelligence and his ability 
to rate the intelligence of others, and 
also to rate sense of humor. The cri- 
teria were pooled ratings. The corre- 
lations between intelligence and 
ability to rate seven other personality 
traits were not significant. Adam's 
study (1) was similar but more com- 
plex. He studied 80 female students 
living together in groups of 10. They 
ranked themselves and each other on 
63 traits, including seven related to 
intellectual functioning; the pooled 
rankings acted as the criterion both 
for measuring the accuracy of the 
judgments and for measuring the 
characteristics of the Js. The seven 
intellectual characteristics had an 
average correlation of only .12 with 
ability to judge others, the highest 
correlations being .21 for “‘observa- 
tion” and for “mentally bright" 
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(significant at 5 per cent level). We 
should note, however, the possibility 
that the use of peer ratings as the cri- 
terion may have provided a measure 
of J’s conformity to his group, rather 
than of his ability to judge his peers. 

Where some independent measure 
of the criterion was used (21, 55, 67, 
73), the accuracy of analytic judg- 
ments of personality traits showed 
significant positive correlations with 
intelligence fairly generally. Vernon 
obtained significant correlations of 
approximately .30 between intelli- 
gence and various analytic measures 
of ability to judge strangers. For 
other judgments, however, involving 
nonanalytic modes, the correlations 
obtained with intelligence were ap- 
proximately zero. 

Taft (67) scored the intelligence of 
his 40 Js according to their best per- 
formance in any of a number of vari- 
ous cognitive tests. This index of in- 
telligence correlated .37 with their 
ability to judge the traits of their 
peers and to predict their peers’ re- 
sponses to an inventory (analytic). 
Academic ability also showed low 
positive correlations with these tests 
of ability to judge others but the 
sample was highly selected in this re- 
gard (senior graduate students). We- 
deck (78), using 203 girls, reported 
saturations of g factor ranging from 
.18 to .34 for his seven varied tests of 
ability to judge others. There were 
no consistent differences in this re- 
spect between the analytic and the 
nonanalytic tests. 

The results of other studies correlat- 
ing intelligence and ability on nonana- 
lytic modes of judging are contradic- 
tory. In Sweet's study (66), 12- to 
14-year-old boys at a YMCA camp 
predicted how their peers would re- 
spond to a questionnaire on 22 dif- 
ferent activities. There was a “high 
relationship’’ between their accu- 


racy and their CAVI scores. The 
assessees in the OSS procedures (55) 
wrote personality sketches of their 
peers and these were rated by the as- 
sessment staff for accuracy. Despite 
the admitted unreliability of the cri- 
teria, the assessees’ ability to judge 
correlated significantly with various 
verbal intelligence tests (.32 to .48) 
and .54 with the final staff rating on 
Effective Intelligence. The latter 
rating, however, was contaminated 
both with the scoring of the judging 
test and with the intelligence test re- 
sults. 

The only other study found that re- 
ported a significant positive correlation 
between judging ability and verbal 
intelligence (.21) was that of Kanner 
(38) which correlated the ability of 
students to judge the intended emo- 
tions in the Feleky poses and their 
Thorndike intelligence scores. Dy- 
mond (21) reports a positive correla- 
tion between her Js’ ability to predict 
the S’s self and other ratings on six 
traits and Js’ scores on the Wechsler 
Performance Scale. However, a fur- 
ther study repeating the same tech- 
nique (43) found no correlation with 
the ACE Group Test. Similarly Kelly 
and Fiske (40) found a significant cor- 
relation between ability to predict 
inventory responses and score on the 
fluency subtest of the PMA but not 
with any of the other subtests or with 
the Miller Analogies test. 

All the other studies correlating 
nonanalytic modes of judging with 
intelligence found no correlation, for 
example, between (a) the accuracy of 
predictions of the questionnaire re- 
sponses of eight psychology students 
and the Js’ ACE scores (9), (0) ability 
to “throw’’ scores on a masculinity- 


femininity test and Thorndike “‘intel- 
ligence score’ or grade point ratio 
(41), (c) Army Alpha scores and the 
ability of Js to match correctly a 
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short behavior record with personal- 
ity sketches (23), (d) ability to pre- 
dict group responses and the Otis, 
Cc ,perative General Culture Test, 
or Thorndike Reading Test scores of 
the Js (69, 70), (e) various intelli- 
gence tests and the ability of gradu- 
ate students to match mosaic produc- 
tions with acquaintances (67), and 
(f) various group intelligence tests 
and ability to predict inventory re- 
sponses on the basis of a brief ob- 
servation of S’s expressive behavior 
(28). 

Conclusions. There seems to be a 
positive relationship between intelli- 
gence and ability to judge others an- 
alytically. The highest correlation 
reported is .55 but the use of intel- 
lectually homogeneous groups in 
most of the studies would have re- 
duced the correlations obtained. The 
superiority of more intelligent Js is 
probably the most pronounced in 
ability to rate the intelligence of 
others. The results for analytic 
modes do not appear surprising as 
such modes require a precise under- 
standing of the meaning and applica- 
tion of abstract terms (traits). 

Nonanalytic modes of judging tend 
to manifest lower correlations be- 
tween intelligence and accuracy of 
judgment. It is possible that accu- 
rate nonanalytic judgments of others 
are more a function of good percep- 
tual and judgmental attitudes than 
of the use of abstract intelligence, 
provided the mode of making the 
judgments is clearly within the level 
of comprehension of J. 


Training in Psychology 


It is often assumed that qualified 
psychologists are more capable than 
laymen of making unbiased judg- 
ments, since they receive training in 
the dynamics of personality and also 
in the correct manner of making judg- 
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ments, e.g., using fixed standards, 
considering only relevant evidence, 
avoiding projection, combining prob- 
abilities in their correct weight, etc. 
On the other hand, some writers argue 
the opposite. For example, Murray 
(51) claims that the use of analytic 
perception and induction together 
with the repression of emotion and 
feeling leads to poor ability to judge 
others. 

Let us now turn to the experimen- 
tal evidence on the question. Are 
those who have taken courses in psy- 
chology more accurate judges than 
those who have not? In a test of 
judging emotional expressions in the 
Boring-Titchener models on an ad- 
jective check-list, students who had 
completed at least one course in psy- 
chology were, if anything, less accu- 
rate than students just taking their 
first course (13). Hanks (34) found 
no relationship between training in 
psychology and the ability to predict 
S’s answers to inventory questions 
from biographical and other inven- 
tory data. Polansky (56), on the 
other hand, found graduate students 
in psychology to be better judges than 
those without psychological training 
when judging Ss who were friends of 
the author. Whether the Ss, as well 
as the Js, were also trained in psy- 
chology is not stated. The tests re- 
quired Js to make specific behavioral 
predictions about three Ss whose ac- 
tual behavior was known. 

Studies comparing the ability of 
professional psychologists with non- 
psychologists do not, in general, sug- 
gest that those trained in psychology 
are better judges. If anything, the 
contrary seems to be the case. In 
Estes’ experiment (23) psychologists 
were significantly poorer than the 
average of a wide variety of Js (pro- 
fessional and nonprofessional per- 
sons) in judging others on ratings, 
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check lists, and matching tests. The 
material presented consisted of brief 
samples of S’s expressive behavior re- 
corded in a movie. These findings 
are supported by Wedell and Smith 
(79) who compared qualified, experi- 
enced clinicians with untrained, in- 
experienced interviewers on their 
ability to predict the responses of 200 
interviewees to an attitude question- 
naire. The untrained Js made the 
more accurate predictions. 

Luft (44, 45) compared the ability 
of clinicians (psychiatrists, psycholo- 
gists, and social workers), graduate 
psychology students, and physical 
science students on a series of tests in 
which they were required to predict 
the responses of individuals to objec- 
tive and projective test items. The 
physical scientists were superior to 
all the other groups of Js on the tasks 
taken as a whole. On the other hand, 
Taft (67) found psychologists to be 
superior to graduate students in vari- 
ous other disciplines. In a study of 
ability to rate traits and to predict 
the inventory responses of graduate 
students participating in an assess- 
ment program, judgments made by 
the assessment staff (psychologists) 
were more accurate than similar judg- 
ments made by the assessees them- 
selves (both staff and peer ratings be- 
ing used as criteria for the ratings). 
The assessees did not include psy- 
chology graduate students, but the 
assessment psychologists were ‘‘defi- 
nitely’’ superior to social science stu- 
dents and “probably” superior to 
physical science students in judging. 
In this latter respect, the results ap- 
pear to contradict those of Luft, but 
there was some evidence (not re- 
ported by Taft [67]) that the superi- 
ority of the psychologists in Taft's 
study was due primarily to the ac- 
curacy of the experimental psycholo- 
gists rather than the clinical. In 


Luft’s study the psychologists were 
clinicians. 

Let us now consider some studies 
in which Js with different degrees of 
training in psychology are compared. 
Kelly and Fiske (40) did not find ad- 
vanced graduate students of clinical 
psychology any more accurate at pre- 
dicting the personality inventory re- 
sponses of patients whom they had 
diagnosed through normal psycho- 
logical techniques than similar stu- 
dents with one year less training. 
Kelly and Fiske were also able to 
compare the validity of the assess- 
ments of the professional promise of 
clinical psychology trainees made by 
other graduate students in clinical 
psychology with similar assessments 
made by professional psychologists 
and psychiatrists. The judgments 
were made on the basis of test proto- 
cols and interview reports and the 
Ss were actually unknown to the Js. 
The validity of the assessments was 
determined by S’s later performance 
as a clinical psychologist. The au- 
thors conclude that the students 
“utilized the materials as effectively 
as the more mature 1947 staff [psy- 
chologists]"’ (40, p. 175). However, 
the students had certain advantages 
over the psychologists—they had 
greater similarity to the Ss and had 
themselves undergone the experience 
of assessment; also, being in training 
themselves, they had a more appro- 
priate frame of reference for inter- 
preting the criterion (as the authors 
point out). These factors are less 
likely to explain Soskin’s finding (62) 
that experienced clinical psycholo- 
gists were not more accurate than 
graduate students in predicting the 
inventory responses of a 26-year-old 
mother, from her projective test pro- 
tocols. 

Professional psychologists were 
more successful than psychology stu- 
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dents in correctly diagnosing the psy- 
chiatric patients depicted in the 
Szondi Test (57). The superiority, 
however, could well be due to the 
psychologists’ greater familiarity 
with the diagnostic categories used 
in the test. Technical knowledge 
could also account for the clinicians’ 
superiority in Luft’s experiment re- 
ferred to above. Luft found that the 
clinicians were significantly superior 
to psychology students in predicting 
two Ss’ responses to a projective test 
(but not to an objective test) after 
they had been provided with written 
interview data about the Ss. 

Conclusions. ‘The results on the 
comparative ability of nonpsycholo- 
gists, psychology students, and pro- 
fessional psychologists to judge other 
people are partly obscured by the 
effect of similarity in age and aca- 
demic status between J and S, At- 
tempting to allow for this effect in the 
reported results, physical scientists, 
and possibly other nonpsychologists, 
e.g., personnel workers, appear to be 
more capable of judging others ac- 
curately than are either psychology 
students or clinical psychologists. 
There is a suggestion that experimen- 
tal psychologists may be superior as 
judges to clinical psychologists, but 
this conclusion must await further 
evidence. 

There is also evidence that sug- 
gests that courses in psychology do 
not improve ability to judge others 
and there is considerable doubt 
whether professional psychologists 
show better ability to judge than do 
graduate students in psychology.‘ Do 
these findings necessarily lead to the 


‘In spite of this finding there is reason to 
believe that ability to judge others can be 
improved by specific training in judging and 
repeated specific practice, except where the 
person already has good ability to judge others 
(3, 4, 14, 20, 33, 37, 47). 


conclusion that training in psychol- 
ogy blunts a judge’s ability? Actu- 
ally, the results could be wholly at- 
tributed to selective factors operating 
(a) in the accreditation of graduate 
students as psychologists and (6) in 
the selection of psychology as a ca- 
reer. On this latter point, there are 
some interesting but unanswered 
questions. Perhaps those taking up 
psychology, especially clinical psy- 
chology, are too concerned about so- 
cial relations (see below, Altitude To- 
ward Social Relations) to be good 
judges, or perhaps they have had in- 
sufficient experience with a wide 
range of people. Watson (77) has 
pointed out that many professors and 
clinicians tend to live in isolation 
from the general life experiences of 
the people whom they are endeavor- 
ing to understand. 


Esthetic Ability and Sensitivity 


G. W. Allport claims that of all the 
characteristics of a good judge of 
others “‘esthetic’’ ability stands above 
the others, and can even compensate 
for such things as lack of intelligence 
and experience (5, p. 538). 

Allport and Allport (3) reported 
that the only correlation found for 
“susceptibility to social stimuli’’ 
(ability to judge emotional expres- 
sion—Rudolph poses?) was with ar- 
tistic ability. Of the nine best judges 
on the test, seven had literary pro- 
ductions to their credit. Unfortu- 
nately, no comparable figures are 
given for those low on the test. Ver- 
non (73) found that the more accu- 
rate raters of strangers and of ac- 
quaintances (analytic) tended to be 
more artistic according to their 


scores on various musical and art 
judgment tests and were rated high 
on esthetic values by their peers. 
Ability to match character sketches 
with case history material did not 
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correlate with esthetic judgments. 
Bender (9) reports on a test of ability 
to write accurate descriptions of the 
Ss on the basis of test profiles. The 
author acted as a referee for accuracy. 
He states that all the good judges 
were interested in literary or dramat- 
ic activity but again no statement is 
furnished regarding the poor judges 
in this respect. On another test by 
Bender in which J was required to 
rate S’s standing on various opinion 
scales, ability to judge did not cor- 
relate significantly with the Meier- 
Seashore Art judgment test. Estes 
(23), however, found clear evidence 
that good ability to match character 
sketches was related to painting and 
dramatic avocations but not musical. 
Taft (67) concluded that the abil- 
ity to rate traits accurately and to 
predict the Ss’ inventory responses 
(analytic) correlates positively with 
“simple, traditional, artistic sensi- 
tivity."" This was measured by art 
judgment tests similar te those used 
by Vernon and Bender in which high 
scores are allotted for ability to fol- 
low traditional artistic rules. On the 
other hand, the same Js’ scores, on a 
test in which their preferences for 
various patterns were compared with 
the preferences of actual artists,’ cor- 
related negatively with analytic judg- 
ing ability. None of the artistic tests 
correlated with performance on a 
nonanalytic test (matching). There 
was also zero correlation between all 
the tests of ability to judge others 
and an index of dramatic ability, i.e., 
ratings of the Js’ ability to empathize 
with roles in a role-playing test. 
Conclusions. The ability to judge 
others seems to be higher in those per- 
sons who have dramatic and artistic 


5 The distinction between these two types 
of esthetic judgment corresponds roughly 
with the symmetrical and asymmetrical types 
described by Barron (7). 


interests, but the relationship is not 
as clear-cut in the case of dramatic 
and artistic ability. 

The evidence on dramatic ability 
is too limited as yet, but in view of 
role-playing theory, this would seem 
to be an important area for study, 
particularly in its relationship to the 
ability to make accurate nonanalytic 
judgments of others. 

The conclusion on artistic ability 
must turn on the definition of such 
ability; while there may be a positive 
correlation between ability to judge 
others on analytic modes and ability 
to endorse traditionally accepted es- 
thetic rules this may possibly be a 
function of intelligence and interest. 
The above relationship is not found 
where the sophisticated standards of 
professional artists are used and it 
might well be that professional ar- 
tists, if tested, would perform poorly 
on tests of ability to judge others, 
The accuracy of nonanalytic judg- 
ments seems to be even less depend- 
ent on artistic ability—with the pos- 
sible exception of literary ability— 
than are analytic judgments. (This 
is explicit in Taft and implicit in Ver- 
non.) 


Emotional Stability and 
Character Integration 


It may be argued that the well- 
adjusted person is less subject to pro- 
jecting himself into others than a 
poorly adjusted person and therefore 
he is able to judge them better. How- 
ever, it is also possible to argue that a 
poorly adjusted person, who at tne 
same time is aware of his emotional 
difficulties, is more sensitive to simi- 
lar difficulties in others. Most of the 
findings in this area indicate that the 
first argument is the more correct, 
but the evidence is not unequivocal. 

Where the tests of ability to judge 
are analytic in nature, the studies 





14 RONALD TAFT 


have reported a significant positive 
relationship between accuracy and 
emotional adjustment. The only pos- 
sible exception is that of Adams (1) 
who reports that his good raters of 
personality traits tend to be rated 
high by their peers on the following: 
touchy, lack courage, work for pres- 
ent, independent, talkative, egotistic. 
These are signs of poor superficial 
adjustment, but do not necessarily 
point to a fundamental maladjust- 
ment. The other experiments reveal 
a positive relationship between an- 
alytic tests of judging and (a) the 
Bell Adjustment Inventory (69, 70), 
(b) the Character Education Test 
(66), (c) teacher’s ratings that chil- 
dren need psychological help (32), 
(d) steel-workers’ accident-proneness 
(63), (e) the California Authoritarian 
Scale (59), and (f) ratings of graduate 
students by their faculty on ‘“‘per- 
sonal soundness” (67). The latter 
study also reported a positive correla- 
tion between ability to make accurate 
analytic judgments and the “‘psychot- 
ic” scales on the MMPI (Pa, Pt, 
Sc). 

The relationship between various 
measures of adjustment and nonan- 
alytic tests of judging are more equiv- 
ocal. Studies using the MMPI sup- 
port the results quoted above: the 
ability of clinical psychologists to 
predict the inventory responses of 
their patients correlated negatively 
with their scores on the Pt scale (41); 
good judges (undergraduates) on Dy- 
mond’s empathy test (43) tended to 
be low on Pd, Pa, Pt, and Sc, and 
psychology graduate students who 
were rated high by their peers on 
“role-playing ability’’ (nonanalytic?) 
also were low on these latter scales 
and on the D scale (46). 

Emotional adjustment as measured 
by the Bell Scale correlated positively 
with ability to match the Ruckmick 


and Frois-Wittman emotional ex- 
pression pictures (26) but not with 
ability to predict inventory responses 
after observing S’s expressive behav- 
ior (28). A test of matching a short 
film of expressive behavior with char- 
acter sketches showed zero correla- 
tion with the Bernreuter (23) whereas 
matching mosaic productions with 
the acquaintances who produced 
them showed a significant negative 
correlation with ratings on “‘personal 
soundness” (67). 

Conclusion. Ability to judge others 
on analytic modes correlates posi- 
tively with emotional adjustment; 
presumably the more psychologically 
significant aspect of this correlation 
is that poor judges tend to be poorly 
adjusted, and therefore, probably 
more likely to allow personal biases 
to affect their judgments. Studies are 
still needed in this area using Ss who 
are more heterogeneous on emotional 
adjustment than the highly selected 
groups used in most of the experi- 


ments reported. Such studies would 
perhaps throw light on the connec- 
tion between poor adjustment and 
poor ability to judge. The need for 
further studies is even more evident 
in the case of nonanalytic modes of 
judging in which the evidence is more 


contradictory. There is, however, a 
clear trend for the MMPI on both 
analytic and nonanalytic judgments 
—the poorer judges tend to be ele- 
vated on the psychotic scales, in par- 
ticular, ‘“psychasthenia.”’ 


Self-Insight 


Is there any relationship between 
having a good knowledge of oneself 
and being an accurate judge of oth- 
ers? The inference that there is such 
a relationship seems reasonable if we 
accept the theory that we learn to 
know ourselves by our acceptance of 
the attitudes of others toward our- 
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selves, and that we learn to know 
others by observations and inferences 
deriving from our introjection of the 
behavior of others. According to this 
formulation the acquiring of self- 
knowledge and knowledge of others 
are indispensable to each other. Writ- 
ers who have stressed this viewpoint 
include Mead (48), Sullivan (65), and 
Cottrell (19). 

Before looking at the experimental 
evidence on this question, we should 
refer briefly to a difficulty involved in 
handling the concept of insight ex- 
perimentally. For experimental pur- 
poses self-insight is usually defined 
operationally as the ability of J to 
judge himself accurately, using as the 
criteria of accuracy judgments made 
by some other persons. However, it 
is possible, especially in the case of 
persons with complex, recondite per- 
sonalities, that the self-judgments 
may be more veridical than those 
made by others. The use of tests, ob- 


jective and projective, as the criteria 
can somewhat ameliorate this diffi- 


culty (e.g., 73). Other steps that 
might be taken are: (a) to use, as 
criteria, judgments made by “‘ex- 
perts’’ (eg., 53) defined either as per- 
sons who know the person well or as 
professional psychologists—although 
as we have seen there is considerable 
doubt whether these latter Js are, in 
fact, more expert than lay Js; (6) to 
require S to judge himself as he pre- 
dicts he will be judged by the other 
Ss, and then to use these latter judg- 
ments as the criteria (e.g., 20)—this 
method has the disadvantage of con- 
taminating judgments of self with 
judgments of others; and (c) to use 
over-all clinical judgments of the per- 
son's ‘‘self-insight’’ (e.g., 73, 80). 

In many studies it has been found 
that the Js tend to rate themselves 
high on admirable traits and low on 
reprehensible ones (3, 17, 32, 60, 67). 


Consequently, those who are actually 
high on admirable or low on repre- 
hensible traits will tend to be scored 
higher than others on self-insight. 
This artifact operates in all studies of 
self-insight, no matter how measured, 
and could affect the relationship 
found between self-insight and abil- 
ity to judge others. 

Let us now look at the results of ex- 
periments in this area: 

1. Do good judges of self possess 
the same personality characteristics 
as good judges of others? Adams 
tested the ability of girls to rank 
themselves and acquaintances on 63 
traits, using the pooled rankings as 
the criterion (1). He concluded that 
good judges of self and good judges 
of others possess quite different 
personality syndromes—strangely 
enough, the former were oriented to- 
wards society, whereas the latter 
were more egotistic. Compared with 
the good judge of others, the good 
self-judge is more intelligent, pos- 
sesses the more desirable emotional 
attributes, and is much more socially 
minded. Vernon (73) obtained sub- 
stantially the same findings as 
Adams, using ratings by the Js of 
themselves, friends, and strangers. 
He used both pooled judgments and 
tests as criteria, but note the weak- 
ness that these tests were mainly of 
the inventory, i.e., self-rating, type. 
When he grouped the intercorrela- 
tions between his data into “logically 
related aggregates,”  self-ratings 
formed a different cluster from rat- 
ings of others. The good self-raters 
tended to have a good sense of humor 
and good abstract intelligence and 
were sociable, whereas the good 
judges of others were less sociable 
and intelligent, but more artistic than 
good self-raters. 

The Ss (graduate students taking 
part in a “‘living-in’’ assessment pro- 





16 RONALD TAFT 


cedure) in Taft's experiment (67) 
predicted their sociometric standing, 
their relative status in the group on 
inventory item responses, and also 
how their peers would rate them on 
six traits. Good judges of self were 
taken as those assessees who were ac- 
curate in making these predictions. 
The assessees were rated by the staff 
on the Gough check list of 279 de- 
scriptive adjectives, and the charac- 
teristics of good judges of self were 
thus compared with the characteris- 
tics of those assessees who were good 
judges of the traits and the inventory 
responses of their fellow assessees. 
Sixteen of the adjectives distin- 
guished good judges of self from good 
judges of others, at the 1 per cent 
level of confidence. Similarly, ten 
adjectives distinguished the poor 
judges of self from the poor judges of 
others. The qualitative differences 


confirmed Adams’ and Vernon's find- 
ing that good judges of self tend to 
be more sociable than good judges 


of others. 

2. What relationship has been 
found between self-insight and ability 
to judge others? In the most simpli- 
fied instance, where the correlations 
are based on J's ability to judge him- 
self and his peers on single traits, a 
high positive relationship has been 
found consistently. Traits on which 
this relationship has been estab- 
lished include both socially approved 
and socially disapproved characteris- 
tics as follows: beauty (58), leader- 
ship (32), empathy (19), obstinacy 
and disorderliness (60). Evidence 
has also been presented demonstrat- 
ing that Js possessing disapproved 
traits but lacking insight into them 
tend to project these traits into per- 
sons whom they are judging more 
than do other Js who possess the 
same traits but do not lack insight 
(60, 80). Where self-insight has been 


measured over a number of traits, or 


in a “global” fashion, the results 
quoted are conflicting. For example, 
in a further experiment, Vernon used 
peer ratings of his Js’ self-insight as 
the measure; this measure correlated 
significantly with the objective meas- 
ure of ability to rate self accurately 
described above, and correspondingly 
these peer-ratings showed no correla- 
tion with various measures of ability 
to judge (ability to rate traits, to 
make predictions about the ” thavior 
of friends and strangers, and to 
match character sketches with case 
study material). Similarly, Frenkel- 
Brunswik (27) found “no consistent 
relationship’”’ between three psychol- 
ogists’ self-insight and their accuracy 
in rating adolescents on the Murray 
needs. The criteria for measuring ac- 
curacy of both self and other ratings 
were the judgments made by the 
other two psychologists. Taft (67) 
also failed to find a significant correla- 
tion between Js’ scores on the over- 
all index of ability to judge others 
(rating the Ss on six traits and pre- 
dicting their responses to inventory 
items) and the index of ability to 
judge self, described above. On the 
other hand, Norman (53) did obtain 
significant correlations in a very simi- 
lar study, using as Ss 72 graduate 
students in psychology who were tak- 
ing part in an assessment program. 
Their over-all ability to judge them- 
selves on 31 personality traits cor- 
related positively with their ability 
to judge their peers on the same 
traits, both when peer ratings and 
when staff ratings were used as the 
criteria. 

The reason for the clear-cut dif- 
ferences in the results of the above 
experiments is not obvious. One dif- 
ference between Norman's study and 
the others is that, in his experiment, 
ability to judge self and others is 
measured by J's accuracy in rating 
himself and his peers on the same list 
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of traits. In Vernon's and Taft's stud- 
ies the measure of ability to judge 
included more than tests of ability 
to rate self and others on a list of 
traits. Frenkel-Brunswik’'s study dif- 
fers in that the Js were not rating 
their peers. 

Taft's results also suggest one sub- 
tlety not mentioned by Norman; a 
chi-square analysis revealed a signifi- 
cant relationship that would not 
show up in either a linear or a cur- 
vilinear correlation: dividing the Js 
into three groups on the basis of their 
accuracy, it was found that whereas 
good judges of others tended to be 
average on ability to judge self and 
average judges of others tended to be 
good judges of self, poor judges of 
others tended to be either good or 
poor judges of self (rather than aver- 
age). 

Conclusion. Persons who show in- 
sight into their own status with re- 
spect to their peers on individual 
traits tend also to rate their peers ac- 
curately on those traits. However, 
when over-all indices are obtained of 
the subject's self-insight and of abil- 
ity to judge others, using a variety of 
tests of these abilities, the relation- 
ship is not so clear-cut. 

Good judges of self have been 
shown to possess a number of traits 
that differentiate them from good 
judges of others: in particular, great- 
er social orientation. A more mature 
consideration is required in future 
work on this topic of the extent to 
which motivational forces that cause 
J to be accurate in his judgments of 
himself also cause him to distort his 
judgments of others. 


Social Relations 
Social Skill and Popularity 


Common sense would suggest that 
a person who possesses good ability to 
judge people is able to use this ability 


to advantage in situations requiring 
social skill, e.g., in leadership or sales- 
manship. It may be argued that abil- 
ity to empathize with others and to 
play their roles, particularly in their 
relationships to oneself, is positively 
related to social skill. While there is 
some evidence to support this view- 
point, there is sufficient evidence to 
the contrary to force us to seek a 
more sophisticated attitude. In a 
study related to the area of role play- 
ing and social judgment, Moreno and 
Moreno (49) found that those chil- 
dren who were able to perceive social 
roles more accurately than others 
(according to the psychologist’s judg- 
ment) were not necessarily those who 
could enact them best. 

Vernon (73) found that the ability 
to judge strangers is related positive- 
ly and significantly to scores on the 
Social Situations and the Memory for 
Names and Faces subtests of the 
George Washington Test of Social 
Intelligence, and negatively to scores 
on the Observation of Human Be- 
havior subtest. In the OSS study 
(55), however, scores on the Judg- 
ment of Others Test (ability to write 
accurate personality descriptions of 
one’s peers) had such a low correla- 
tion with the staff ratings of the 
candidates on Social Relations that 
it was discarded as a measure of this 
trait. Taft (67) used several tests 
and ratings to measure the social 
skills of his subjects; in a role-playing 
test the good judges of others were 
rated significantly lower on ‘‘ingenu- 
ity’’ than were the poor judges; there 
were no differences between good and 
poor judges in the ratings made by 
the assessment staff on ‘‘persuasive- 
ness"’ in discussion nor on “‘likeabil- 
ity’’ as defined by peer sociometric 
choices. Both analytic and nonana- 
lytic judgments showed the same pat- 
tern. 

Norman, in the Clinical Psycholo- 
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gist Assessment Study described 
above, also failed to obtain a linear 
correlation between ability to judge 
and a sociometric measure of the ac- 
ceptability of his Js as professional 
co-workers. There was, however, a 
curvilinear relationship; those who 
received a medium score on accepta- 
bility were significantly better at 
judging than those who received a 
low score on acceptability. There is 
a hint here that a study which in- 
cluded more socially maladjusted per- 
sons than is usually found among Js 
in these types of investigation would 
reveal the expected positive correla- 
tion between social skill and ability 
to judge others. 

Studies of the ability to predict 
group responses have found almost 
without exception a positive relation- 
ship between ability to judge and 
popularity; also good leaders and 
salesmen tend to be good judges. The 
study by Chowdhry and Newcomb 
(16) finds that those chosen as leaders 


in a sociometric test are good judges 
of group opinion only on topics rele- 


vant to the group. Ability on the 
“mass empathy” test (Psychometric 
Affiliates) correlates positively with 
various measures of the popularity 
and efficiency of union officials (72), 
with the ability of new car salesmen 
to sell (68), with the sociometric 
scores of steel workers as working 
companions (63), and with socio- 
metric ratings of students on leader- 
ship after a half-hour discussion (8). 
In this latter study the Dymond test 
also correlated significantly with 
these ratings on leadership. Gage 
(29) reviews several studies (by Gage 
and Suci, Wood, and Sprunger) that 
confirm the positive relationship be- 
tween the degree to which a group 
accepts J and his ability to predict 
the responses of this group to various 
opinion items. The one exception 
quoted is a Naval Leadership Project 


study which failed to find a correla- 
tion between the popularity of leaders 
on a ship and their ability to predict 
their crew's opinions on an attitude 
scale. Gage also found a positive cor- 
relation with popularity among high 
school students when the item re- 
sponses were predicted by the Js for 
each individual S separately instead 
of for the group as a whole (28). 
Conclusions. The ability to predict 
how Ss will respond to opinion items 
shows a consistent positive relation- 
ship with measures of social skill, 
such as leadership, salesmanship, and 
popularity. This relationship would 
follow logically from the probability 
that these types of social skill are 
aided by the ability to predict how 
people will behave. Unfortunately, 
this relationship could also be due 
simply to a combination of J’s tend- 
ency to project his own responses 
onto his S and his being in fact simi- 
lar to S in his responses. Until these 
factors have been more carefully iso- 
lated, it is impossible to know which 
explanation is the correct one. Re- 
cently a method has been proposed 
(11) for separating the effects of pro- 
jection, similarity, and empathy in 
such tests of judging ability. Other 
tests of ability to judge others, e.g., 
rating traits or matching expressive 
behavior, do not show this same con- 
sistent relationship with social skills. 


Attitude Towards Social Relations 


In addition to considering the abil- 
ity to judge others as a social “‘capac- 
ity” it should also be considered as an 
“attitude.”’ An interest in social re- 
lations does not, however, mean good 
ability to judge the personality of 
others. 

Adams working with the judgment 
of traits (see above) found that the 
good judges were independent but 
“talkative.”” “The good judge tends 
towards the egotistic...is cold- 
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blooded towards others and not in- 
terested in them ...he develops a 
shrewd ability to measure others, not 
as human beings, but as tools” (1, p. 
181). This is consistent with the find- 
ings already mentioned that good 
judges of others tend to be less socia- 
ble than good judges of self (1, 67, 
73). 

Dymond (21) reports characteris- 
tics for her “high empathy” group 
that were more comparable to 
Adams’ good self-raters than his good 
raters of others. On the basis of a 
TAT analysis, they are described as 
“outgoing, optimistic, warm, emo- 
tional people, who have a strong in- 
terest in others,”” while those low in 
empathy are “either self-centered 
and demanding in their emotional 
contacts or else lone wolves who pre- 
fer :o get along without strong ties to 
other people” (21, p. 349). Possibly 
this finding results from Dymond's 
method of measuring “‘empathy”’ 
which includes in the score the ability 
to predict how the Ss will rate J as 
well as how they will rate themselves. 
This former measure is close to what 
we have called ability to judge self. 

Taft asked his Js to check their 
own traits on the Gough Adjective 
Check List; the poor judges selected 
socially relevant traits almost exclu- 
sively, e.g., egotistical, noisy, whereas 
the good judges put far more stress 
on traits concerned with executing 
tasks, e.g., imdustrious, patient. In 
the tests of judging, the poor judges 
were significantly more likely to make 
their errors in the direction of gener- 
osity to their Ss than were the good 
judges. These and other data lead to 
the conclusion that the poor judges 
were more socially oriented than the 
good judges, while the latter were 
more task oriented. The social de- 
pendence of the poor judges makes 
them unwilling—or, perhaps, unable 


—to judge their Ss in a “hard- 
headed,” extraceptive manner. 
Conclusion. The evidence supports 
the contention that social detach- 
ment is a necessary prerequisite for 
making accurate judgments of others. 
This social detachment of the good 
judge of others could well account 
for the superior ability in this re- 
spect of physical scientists, referred 
to previously (45, 67), since this 
group might be thought to be less 
concerned with social matters than 
psychologists or social scientists. 


Judgability 


Are good judges those who are 
themselves more difficult to judge? 
On two grounds it might be expected 
that this would be the case: first, ‘As 
a rule people cannot comprehend 
others who are more complex and 
subtle than they” (5, p. 515), and, 
second, good judges are less socially 
expressive than poor judges, as we 
have just seen. Actually, Bender 
states that the better judges in his 
study were, in his opinion, persons 
whom others would have found more 
difficult to judge (9). His test re- 
quired the Js to predict, on the basis 
of character sketches of eight Ss, 
their scores on seven opinion scales. 
Taft also found that his good judges 
were less accurately judged by their 
peers than were the poor judges. 

Dymond found the contrary in her 
study, but since her good judges were 
more sociable than the poor ones, this 
suggests that the judgability of good 
and poor judges is possibly a function 
of their respective sociability. The 
validity of this explanation is borne 
out by the fact that, in an unreported 
part of Taft’s experiment, the socia- 
bility of the Ss was positively cor- 
related with the accuracy with which 
they were judged by both the assess- 
ment staff and their peers. 
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OVER-ALL SUMMARY 
AND CONCLUSIONS 

The ability to judge others has 
been considered as a _ personality 
trait, and its correlates have been dis- 
cussed. Five different methods of 
measuring this ability have been de- 
scribed, and it was suggested that the 
results of the studies quoted may 
vary according to the operational 
definition used. This would seem to 
apply particularly to the distinction 
between analytic and nonanalytic 
techniques, although, when we review 
the findings on the correlates of the 
ability to judge, we find few that re- 
veal a definite difference between 
these two types of techniques. 

The contradictions found between 
studies may be due partly to the low 
reliability of the measures used, and 
partly to the effect of specific factors 
such as the type of judgment re- 
quired, the traits being judged, and 
the Ss used. This problem of speci- 
ficity arises with all traits, but it 
seems to be particularly marked in 
the case of the ability to judge others; 
nevertheless, there does seem to be 
sufficient generality on this ability 
to justify describing at least some 
judges as “good” or “poor.” 

A great deal of carefully designed 
investigation is obviously required in 
the area of judging ability, both with 
respect to this ability as a general 
trait, and with respect to differences 
in the ability under specific condi- 
tions. Future experiments should 
employ longer, more reliable tests of 
the ability to judge the characteris- 
tics of others, and should systemati- 
cally vary the types of tests used, the 
types of Ss being judged, and the 
range of Js. Studies are also required 
in which the effect of motivational 
factors can be observed. 

Our review of the literature sug- 


gests that the following characteris- 
tics are fairly consistently found to 
be positively correlated with the 
ability to judge the personality char- 
acteristics of others: (a) age (chil- 
dren), (6) high intelligence and aca- 
demic ability (with analytic judg- 
ments especially), (¢) specialization 
in the physica. sciences, (d) esthetic 
and dramatic interests, (e) insight in- 
to one’s status with respect to one’s 
peers on specific traits, (f) good emo- 
tional adjustment and integration 
(analytic tests only), and (g) social 
skill (only with tests of ability to pre- 
dict S’s behavior). The ability to 
judge correlates negatively with J's 
social dependence and his ‘‘psychas- 
thenic”’ score on the MMPI. Charac- 
teristics showing fairly consistent 
lack of correlation are age (in adults), 
sex, and training in psychology. 
Some possible relationships on which 
more evidence is still required before 
we can substantiate the possible cor- 
relation with ability to judge men- 
tioned in parentheses are number of 
older siblings (negative), literary abil- 
ity (positive with analytic judg- 
ments), and being a clinical psychol- 
ogist (negative). Two characteristics 
that may hold for especially poor 
judges only are poor social adjust- 
ment, and either good or poor (not 
average) ability to judge self, 

The main attributes of the ability 
to judge others seem to lie in three 
areas: possessing appropriate judg- 
mental norms, judging ability, and 
motivation. Where J is similar in 
background to S he has the advan- 
tage of being readily able to use ap- 
propriate norms for making his judg- 
ment. The relevant judging ability 
seems to be a combination of general 
intelligence and social intelligence, 
with the possibility of an additional 
specific factor for nonanalytic judg- 
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ments (“‘intuition’’)—so far only We- 
deck has distinguished such a factor. 
But probably the most important 
area of all is that of motivation: if the 
judge is motivated to make accurate 
judgments about his subject and if he 
feels himself free to be objective, then 
he has a good chance of achieving his 
aim, provided of course that he has 


the requisite ability and can use the 
appropriate judgmental norms. The 
act of judging one’s fellows is a pur- 
posive piece of behavior that involves 
not only conscious motivation but 
also ingrained attitudes towards so- 
cial relationsnips, including the rela- 
tionships inherent in the act of judg- 
ing itself. 
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The “pure” scientist has tradi- 
tionally prided himself on his concern 
for fact, his indifference to value. 
This attitude was incorporated into 
psychology by Titchener (187), who, 
in his zeal to make psychology scien- 
tific, imitated the classical physicist 
and excluded value along with mean- 
ing and utility from the new science. 
Other “schools of psychology,’’ con- 
ceiving the major problems of their 
science to center around Gestalten, 
purpose, personality, or adjustment, 
rather than around sensation, had, in 
the language of Kéhler, at least some 
place for value in their world of facts 
(97, 98). 
somewhat antiphilosophic inclina- 
tions, considered behaviorism a foun- 
dation for all future experimental 
ethics. Still other psychologists—the 
geisteswissenschaftliche—reserved for 
values a central place in their system 
and rejected the scientific method. 

From these rather heterogeneous 
antecedents there emerged in psy- 
chology during the 1930's applica- 
tions of the scientific method to vari- 
ous aspects of the value problem. 
These and succeeding psychological 
studies of values seem to have been 
directed primarily toward one or 
more of the following problems: 


1. Measuring the values of groups 
of individuals and relating the results 
to other data concerning the groups 
(individual differences). 

2. The origin and development of 
values within the individual. 


' The writer is grateful to Drs. Charles 
Morris and D. W. MacKinnon for critically 
reading major portions of the original manu- 
script and offering many helpful comments. 
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Even Watson (196), with 


3. The influence of an individual's 
values on his cognitive life. 


This review will, consequently, be 
centered around these three topics. 

First, however, some preliminary 
remarks concerning the psychological 
climate which has produced these in- 
vestigations, together with brief men- 
tion of the context from which they 
are abstracted, are deemed appropri- 
ate. 

The strength of the Titchenerian 
attitude that scientists should main- 
tain a hands-off policy toward values 
has been reflected in the dearth of 
penetrating studies of values in the 
psychological literature and in a gen- 
eral lack of psychological textbooks 
in which value occurs as a major con- 
cept—even in areas like personality 
and social psychology. Several psy- 
chologists, it is true, have included 
value as a principal construct in their 
accounts of behavior (e.g., 2, 11, 132, 
134, 174, 180), but their treatments 
are generally sketchy, for as Asch re- 
marks of ethical judgments, ‘“‘we 
hardly possess today a description of 
them, not to mention a theoretical ex- 
planation” (11, p. 354). 

Probably the most ambitious at- 
tempt to make values an integral 
part of psychology is Clawson’'s de- 
scription of human action in the light 
of ten value situations (41). Even 
though his express purpose is to ex- 
amine psychology in action, most of 
his evidence, however interesting, 
seems overly casual—movie plots, 
magazine contests, newspaper head- 
lines, etc. 

The extent to which psychologists 
can contribute to the study of value 
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is still a subject for debate. Hull be- 
lieved that an adequate theory of be- 
havior must necessarily contain a 
theory of value, but such a theory, he 
insisted, would apply to prediction, 
not to characterization of behavior as 
good or bad (86, 87). Snygg (170) is 
of the opinion that psychology should 
furnish the information for specifying 
what is good for people. Cattell con- 
siders the social psychologist “‘better 
fitted than any other specialist to 
inherit from the philosopher the task 
of developing a science of ethics’’ (37, 
p. 32). Hartmann (78) sees no reason 
why psychologists cannot say of 
goals, as well as of means, that one is 
better than another. Extreme rela- 
tivism in values is questioned by 
Murphy, Murphy, and Newcomb 
(134) who remind that, to the indi- 
vidual at a given time, a particular 
value is absolute. 

Money-Kryle (127) contends that 
in a psychoanalytic framework the 


basic question of ethics, that of ends, 


becomes generally answerable. One 
difficulty with his formulation is that 
what he regards as self-evident and 
objective may be considered by 
others as arbitrary and subjective. 
His locating the origin of all values in 
a sense of guilt is, furthermore, likely 
to be completely unacceptable to 
many value theorists. Mullahy (131) 
maintains that a study of interper- 
sonal relations provides both descrip- 
tive and normative information re- 
garding values and that any scientific 
theory of value must be based on 
interpersonal-relations theory. 

Some of the main issues involved 
appear to be what are usually re- 
garded as “‘philosophical’’—the prob- 
lem of defining ends and means, the 
problem of transition from ‘‘what is’’ 
to “what ought to be’’—and obvi- 
ously there is no easy solution. But 
whatever courses psychologists are 


taking, more and more of them seem 
to concur with Fletcher that psy- 
chology “‘is one science which can not 
consistently waive its responsibility 
for dealing with the problem of hu- 
man values” (62, pp. 259-260). 

Naturally not all material perti- 
nent to the psychology of values is 
included in this review. Recent inter- 
disciplinary and cooperative inquir- 
ies (19, 105, 124, 136, 206) and large- 
scale research projects in other social 
sciences (95, 112) provide stimulating 
background for the psychologist 
studying values. They are, however, 
considered beyond the scope of this 
paper, its purpose being to examine 
objective studies at the psychological 
level. 

Research on the rather specialized 
aspects of values, morals, and beauty, 
is also excluded here, except when it 
sheds light on the discussion of spe- 
cific problems, since either of these 
subjects furnishes in itself material 
for lengthy reviews (39, 90, 128, 138). 

Most psychotherapy could be re- 
garded as involving a study of value 
(13, 23, 70, 154, 210), and some scien- 
tific treatments of mental hygiene, 
like McDougall’s, have as a nucleus 
a “rational scale of values” (116, p. 
305). The sheer existence of psycho- 
therapy, moreover, implies the appli- 
cation of a value system——mental dis- 
sease being defined as undesirable and 
mental health as ‘‘good."" But since 
little systematic scrutiny of values 
during therapy seems to have been 
attempted, studies on the theory and 
process of therapy will not be combed 
for possible inclusion here. 

Finally, it should be noted that 
such terms as attitude, interest, mo- 
tive, need, sentiment, or valence are 
often used interchangeably with 
value, or at least to refer to some as- 
pect of value, and investigations of 
level of aspiration, character, or the 
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superego almost necessarily involve 
evaluations. Including every vari- 
able partly synonymous with value 
would be an encyclopedic task as 
broad as general psychology in scope. 
The traditional “aggregate” ap- 
proach (“‘values”’ equal ‘“‘motivation” 
plus “‘learning’’) has, moreover, been 
seriously questioned (11, 104, 121). 
This review will, consequently, re- 
main oriented around investigations 
which are empirically and systemati- 
cally concerned with the more insti- 
tutionalized values. 


INDIVIDUAL DIFFERENCES IN VALUES 


After brief consideration of the 
problem of measurement of values, 
results of studies concerned with 


value differences between some of the 
traditional groupings (e.g., sex, oc- 
cupation) as well as those examining 
the relationship between value meas- 
urement and those from other tests 
will be presented. 

Problems of measurement. The so- 


cial scientist who attempts to meas- 
ure value is confronted with more 
than routine problems of psychome- 
try (46, 64, 67, 115, 150, 151). The 
highly conditional character of value 
judgments, the apparent lack of 
transitivity in some evaluations, and 
the question of common denominator 
for the possible approaches to values 
have not made the application of 
mathematics to this realm easy. 

True enough, Bentham (20), well 
over a century ago, outlined the 
rudiments of a “hedonic calculus”’ 
but not until Thurstone (185) demon- 
strated the utility of psychophysics 
in studying value, and Vernon and 
Allport (195) centered empirical in- 
vestigations of personality around the 
concept of value, did psychologists 
begin studying value with any degree 
of precision. 

During the past two decades a 


number of instruments have been 
provided for measuring or describing 
the values of individuals and groups 
(4, 43, 64, 65, 74, 89, 113, 183, 184, 
186, 193, 200, 203, 207). (See Duffy 
[48] for a review of the major tests 
and test results in this area up to 
1941.) In this variety, little use has 
been made of other than “straight- 
answer” techniques; other devices 
obviously deserve trial. One might, 
for example, capitalize on distortion 
tendencies somewhat as Hammond 
did in his study of attitudes through 
error-choice (73). Resistance to dis- 
traction or satiation might be ex- 
plored as means of quantifying value. 
Finally, in an area so complex as 
values, one might profitably design 
situational tests on the order of those 
used by the OSS Assessment Staff 
(211) in personality evaluation. 

Furthermore, even though a num- 
ber of standardized tests of value are 
available, the large majority of in- 
vestigators employ the Allport-Ver- 
non Study of Values. This section is, 
consequently, predominantly con- 
cerned with applications of the All- 
port-Vernon to problems of individ- 
ual differences. 

Sex differences. The original find- 
ings that men score higher than 
women on the Allport-Vernon theo- 
retical, economic, and political values 
but lower on the aesthetic, religious, 
and social (195) have been amply 
verified (33, 77, 141, 172, 189). 

Some of the hypotheses advanced 
to explain these differences might, 
however, not be so acceptable. Hart- 
mann, for example, avers that “‘one 
very important item affecting the 
nature of ... value organization is 
the biological cleavage represented 
by the fact of sex”’ (77, p. 112), but 
evidence from anthropologists, nota- 
bly Mead (123), on the importance 
of social and cultural factors in pro- 
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ducing behavioral differences be- 
tween the sexes, tends to discredit 
any biological-determinism theory of 
sex differences in values. 

Eagleson and Bell (52) have sug- 
gested what might be termed an ‘‘out- 
group-compensation” hypothesis to 
account for the predominance of re- 
ligious values in groups which are 
denied equal participation in social 
and occupational areas (women, Ne- 
groes, etc.). The applicability of this 
hypothesis here in the case of women 
is open to question; certainly it fails 
to operate in another outgroup: Jews 
score reliably lower on the religious 
value than either Protestants or Ro- 
man Catholics (75, 172). 

What might be regarded as a pre- 
cursor to the sex differences reported 
for adults are Symonds’ findings con- 
cerning adolescents (177). Boys, he 
found, consider money their principal 
problem, girls, personal attractive- 
ness. Boys were found to be more in- 


terested than girls in money, safety, 
health, civic affairs, study, and recre- 
ation; the condition was reversed for 


personal attractiveness, etiquette, 
and getting along with people. 

Sex differences in evaluation, other 
than specific content differences, have 
also been observed. Girls have been 
shown to be more discriminative than 
boys in evaluative ratings of socially 
approved and disapproved behavior 
(179, 205). Since children are capable 
of making finer discriminations in 
stimulus series deemed of value to 
them (50) and since these experi- 
ments involved social events, which 
are supposedly more important to 
girls than to boys, one may wonder 
how ‘content-free’ these differences 
are. The generality of this keener 
discrimination needs to be tested in 
a value area like the economic or 
political in which sex differences are 
reversed. 


Similarities, not differences, be- 
tween the sexes are emphasized in 
studies of moral attitudes (44, 165). 
The findings of these investigators 
are, however, in no way contradic- 
tory to those obtained from the All- 
port-Vernon for they differ both in 
subject matter and in level of analy- 
sis. 

An elemental approach, Hartmann 
(77) maintains, de-emphasizes psy- 
chological differences between the 
sexes, whereas a more comprehensive 
framework accentuates them, be- 
cause even though elements are iden- 
tical, the organization may be differ- 
ent. One might further suspect that 
psychologically important differences 
would be accentuated and _ trivial 
ones de-emphasized by a more global 
approach since differences discovered 
with a relatively atomistic approach 
might cancel each other, or otherwise 
not appear in molar behavior. How- 
ever, the possibility of glossing over 
important differences by too compre- 
hensive a framework should not be 
dismissed. 

Body and personality type.  Al- 
though Sheldon’s typology (162) in- 
cludes some statements concerning 
the preference behavior of temper- 
aments characterizing ecto-, endo-, 
and mesomorphs, possible relation- 
ships between body type and values 
have not often been examined. Stud- 
ies employing the Allport-Vernon 
suggest that endomorphs are rela- 
tively high on the social, ectomorphs 
on the theoretical, and mesomorphs 
on the political value (42, 168). Data 
provided by Morris (129) indicate 
similar possible relationships between 
preferred paths of life and body 
types. 

Jaensch’s types were found to dif- 
fer in their approach to values, I,/l, 
being intuitive, I, fanatical, I; practi- 
cal, and S, guided by established 
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norms (197). Though the study was 
an intensive one, the subjective ele- 
ment in response categorization and 
the lack of statistical rigor in the 
treatment of the data make repetition 
under more effective controls de- 
sirable before his generalizations can 
be properly evaluated. 

The authoritarian personality, too, 
is pictured as a “type’’ that may be 
differentiated on the basis of value 
structure, among other variables, 
from the nonprejudiced person. He is 
described as generally oriented to- 
ward conventional values, as rather 
rigidly conforming to externalized 
values—an orientation and _ con- 
formity which permeate most of his 
behavior: family, social, political, 
religious, «tc. (1). In view of the 


apparent neglect of the statistically 
independent dimensions of variability 
—e.g., the common variance of anti- 
Semitism and anti-Negroism is esti- 
mated to be about 20 per cent of the 


respective total variance (83)—and 
because of the use of the same sample 
both to establish item discrimination 
and to validate the “‘scales,’’ such 
generalizations concerning the au- 
thoritarian “type are open to ques- 
tion. 

Major academic interest. Probably 
because of their availability, different 
academic groups have often been 
analyzed with respect to their values. 
Results are too numerous to allow de- 
tailed presentation; only major trends 
will be cited. 

Commercial and business students 
score above the Allport-Vernon norm 
for the economic value, below the 
norm on the aesthetic value. Litera- 
ture and drama students are above 
average on the aesthetic, and science 
students above average on the theo- 
retical value. 

Medical students score higher than 
law, business, or engineering students 


on the theoretical, and lower than law 
students on the political. Liberal arts 
students are more aesthetic and more 
theoretical than business or engineer- 
ing students; both business and law 
students are more political than those 
in liberal arts. Law students are less 
economic than business students but 
more economic than those in liberal 
arts, who in turn are more economic 
than medical students (15, 33, 66, 75, 
158, 189, 201, 204). 

When the best predictors for each 
of thirteen criteria of performance in 
clinical psychology were selected by 
Kelly and Fiske (92) from a large 
number of objective-test scores, no 
Allport-Vernon variable was in- 
cluded, though the correlation be- 
tween research competence and the 
religious value was _ substantially 
negative. Furthermore, Veterans 
Administration clinical students were 
found to score higher on the social 
value than the nonclinical graduate 
students in psychology. 

Though many statistically signifi- 
cant differences are reported between 
the values of various academic groups 
and though these differences are 
practically always in the expected 
direction, the order of the differences 
for any single value is so low and the 
possible variation in other values, as 
well as in “alien factors,"’ so great 
that prediction of major academic 
pursuit from value scores would not 
be very efficient. 

Possible “‘blind spots” in sampling, 
furthermore, make generalizations 
from studies of individual differences 
extremely probabilistic in character, 
and generalizations can hardly be 
overly cautious when the sample is 
composed of ‘‘college sophomores.” 

Inquiries regarding academic inter- 
ests generally lead to questions con- 
cerning achievement, aptitude, and 
general mental ability. The possible 





PSYCHOLOGICAL STUDIES OF VALUES 29 


connection between these and values 
will be considered next. 

Intelligence, aptitude, and achieve- 
ment. One persistent problem in the 
history of philosophy has been the 
relationship between truth and good- 
ness. Thorndike gave twentieth- 
century voice to the Socratic maxim 
“To know the good is to do the good,” 
when he asserted that ‘Goodness and 
intelligence are positively correlated” 
(182, p. 4). From a rather exhaustive 
collection of moral and intellectual 
data, ratings, standardized tests, 
school grades, etc., Chaswell (40), 
one of his students, provided evidence 
for his assertion. His statement and 
her data seem reasonable enough in- 
sofar as both labels, “good” and 
“intelligent,’’ entail adjustment to or 
respect for a given culture. A wholly 
satisfactory empirical test for such a 
relationship would seem a little diffi- 
cult, especially in view of Morris’ 
findings (130) that even intraculture 


consensus on the good path of life is 
far from perfect. Some psychologists 


have furnished data on a similar 
problem, namely the relationship be- 
tween dominant value and success in 
intellectual pursuits. 

In some instances these relation- 
ships have been found to be negligible 
(49, 189, 193). Pintner (141), how- 
ever, reports positive relationships 
between the social value and both in- 
telligence and class marks, and nega- 
tive relationships between intelli- 
gence and both the economic and 
political values. In seeking to ac- 
count for these contrasting results, 
one could suspect the difference in 
weight of social items on the tests or 
the difference in samples, Pintner’s 
being a special section of an educa- 
tional psychology class that may have 
been drawn from a population in 
which brightness, liberality, and so- 
cial concern are interrelated. Sup- 


port for the observed relationship 
between the social value and achieve- 
ment in college may be found in the 
significantly higher social values (All- 
port-Vernon) of those who success- 
fully complete college, as compared 
to those of withdrawals (9, 160). 
However, in each instance samples 
were drawn, as in Pintner’s study, 
from a population of education stu- 
dents. The negative correlation be- 
tween the political value and intelli- 
gence is corroborated by Schaefer 
(158). 

It is not apparent in view of the 
way in which most tests of general in- 
telligence are constructed why one 
should expect any relationship be- 
tween intelligence and any single 
value or all values collectively, and 
similarly between the composite 
“class marks’’ and values. A person 
could regard his performance in 
school, or the supposedly related 
marks, as a means to the realization 
of most any value. The investigator 
would, therefore, need to know the 
meaning of the marks to the individ- 
ual student. 

A seemingly more meaningful 
search is that for relationships be- 
tween particular values or value pat- 
terns and academic achievement or 
aptitude in specific areas. When the 
latter is measured by standardized 
tests, significant correlations between 
it and some values emerge. The ACE 
General Science subtest, for example, 
correlates positively with the theo- 
retical value (158), clerical accuracy 
on the Thurstone Employment Test, 
negatively with the social value (189). 

Rothney (155) found some low but 
statistically significant relationships 
between marks in specific courses and 
value scores in a group of high school 
boys, but the forecasting efficiency of 
any single value was less than two per 
cent. 
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Bills (22) explored the question of 
congruence of values between student 
and instructor as related to scholastic 
achievement. Students who held the 
same general value orientation as 
their instructor were found toe have 
higher final class marks (asdeter- 
mined by objective examinations) 
than those of equal mental ability but 
differing in value orientation from the 
instructor. The factors behind this 
relationship seem to merit further ex- 
ploration—does the instructor's sys- 
tem of values lead him to select for 
class discussion and for examinations 
facts and issues which are of more 
interest to the students who are like 
him in value orientation? Are these 
students unwittingly more often re- 
warded by the instructor, finding 
thereby increased incentive to work? 
In the study of the relationship be- 
tween values and achievement or 


ability, Bills’ work represents a major 
step forward, for it demonstrates that 


the value system of the achiever 
should not be considered apart from 
that of the person judging the 
achievement. 

Vocational interests. Much overlap 
is to be expected between vocational 
and academic interests, since in many 
instances academic curricula serve 
also as vocational training. Here as 
in the case of academic interests, only 
major trends can be reported. 

Persons in religious occupations or 
training programs score unusually 
high (generally mean >45 on original 
Allport-Vernon) on _ the religious 
value (33, 114, 139, 148). 

At least seven studies have sought 
to relate results from value studies to 
those from vocational-interest inven- 
tories (10, 31, 49, 58, 157, 193, 204). 
Differences in sampling and in tests 
used make brief comparisons of re- 
sults impossible. Most consistently 
reported relationships link positively 


interest in being an office worker to 
the economic and social values, an 
artist or architect to aesthetic, a 
physician, scientist, or engineer to 
theoretical. Negative relationships 
include office worker-theoretical; ar- 
tist-, architect-, scientist-, or physi- 
cian-economic ; and engineer-political. 

In view of some of the relationships 
isolated, value tests have been 
deemed worthwhile supplements in 
vocational guidance or vocational 
selection to those of the more apti- 
tude-ability variety (6, 7, 120, 161). 

On the basis of Bills’ finding re- 
garding academic success (22), one 
might, furthermore, suspect that vo- 
cational success is significantly re- 
lated to congruence of values of 
employees and employers-managers- 
owners. Certainly, as Fiske (61) 
indicates, the criteria for hiring, 
firing, and promoting are usually 
based on the value judgments of the 
latter groups. This principle is, of 
course, employed in part by some 
vocational-guidance inventories. 

Friendship and marriage. In the 
area of social and family relationships 
significant value relationships be- 
tween pairs of mutual friends and 
between husband-wife pairs have 
been repeatedly demonstrated (24, 
126, 146, 152, 153, 159, 181). To 
conclude, however, that husbands 
and wives (or friends) tend to grow 
more alike in values is unwarranted 
on the basis of the evidence pre- 
sented from cross-sectional studies 
since selective factors (e.g., divorce) 
might increase similarity for the 
group without a single couple becom- 
ing more alike. 

Fisher’s survey (60) of couples 
married long enough to have children 
in college suggests that similarity in 
religious values overshadows the 
others. The implications of this 
finding are certainly not clear; follow- 
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up studies are necessary before an- 
sweriig these and other questions: 
Is the behavior sampled by the All- 
port-Vernon religious value, the com- 
prehension of and adjustment to the 
cosmos as a whole, of such a nature 
that congeniality here far outweighs 
consensus in other areas in producing 
lasting marriages? Did this condition 
of rather high agreement exist at 
marriage or did it evolve? Was simi- 
larity of religious values possibly 
considered most important at the 
time of these persons’ marriages so 
that this was a general condition of 
selection ? 

Some helpful insights might be ob- 
tained by giving more consideration 
to individual husband-wife or mu- 
tual-friend pairs and studying how 
couples or friends with certain degrees 
of relatedness in value patterns differ 
from those of other degrees. Success- 
ful marriages could be compared to 
divorces with respect to value sys- 
tems. 

The safest answer to the question 
raised regarding the significance of 
values in close personal relationships 
is perhaps that in some areas simi- 
larity of values may be vital to 
congeniality, while in others, differ- 
ences may be desirable, while in still 
others, the value may be irrelevant 
to compatibility. A survey by Himes 
(82), for example, indicates that con- 
currence in values centering around 
personality traits and home and fam- 
ily life is important in mate selec- 
tion, those around family and general 
social status not. The problem is 
then one of defining the conditions 
under which the value is important 
to the friendship or marriage relation- 
ship and specifying something regard- 
ing optimal degree of similarity or 
difference. 

Happiness and adjustment. A com- 
plex extension of the preceding prob- 


lem is the attempt to objectify 
happiness and adjustment and relate 
them to values. No significant corre- 
lation was isolated between happiness 
scores, as measured by self- and 
associate-ratings, and any of the 
Allport-Vernon values (76). 

Pintner and Forlano (142, 143) sus- 
pected a relationship between emo- 
tional stability and either conflicting 
values or values held with extreme 
intensity, but various comparisons of 
Thurstone Personality Schedule and 
Allport-Vernon scores resulted in 
negative conclusions. 

When, however, ratings of college 
men were obtained {trom academic 
records, deans, and advisers com- 
posite ratings of adjustment (health, 
home, religious, social, etc.) showed 
positive correlations with religious 
and social values, negative with eco- 
nomic (188). A similar sampling of 
the female college population should 
provide interesting complementary 
data. 

In the clinical situation Rogers 
(154) has made the observation that 
as a client becomes better adjusted, 
certain of his attitudes toward values 
change. He progresses from a confi- 
dence in a rigid introjected system of 
values through a period of confusion 
to the realization that he may do 
much to shape his own values. 

The popular view of the aesthete as 
neurotic is not supported when the 
latter is defined in terms of Bern- 
reuter scores and the former by the 
Allport-Vernon (199), though the 
test was far from definitive and the 
population rather specialized (Negro 
music students). For extraversion- 
introversion, however, significantly 
higher aesthetic scores are reported 
for the introverted third of a group of 
college men than for the extraverted 
third (164). 


As a possible additional approach 
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to the value-adjustment problem one 
might study conjointly the adjust- 
ment of a person, his own system of 
values, and the value of the psycho- 
logical group to which he belongs on 
the theory that individuals who find 
their principal associates drastically 
different from them in value orienta- 
tion are likely to be maladjusted. In 
another approach one might have 
persons take a value test under two 
conditions, once describing his actual 
values, once indicating those he feels 
he -ought to hold. Discrepancies 
could then be analyzed in relation to 
adjustment. 

Religion. In an effort to study the 
effect of religious background on 
value orientation, Woodruff admin- 
istered his Study of Choices to 12 
groups of persons who differed in re- 
ligious affiliation and in socioeco- 
nomic and geographic background. 
Since the groups were chosen because 
they differed widely in religious back- 
ground, his principal result that re- 
ligion showed a wider range of vari- 
ation than any other value is hardly 
unexpected. 

Even if one overlooks the fact that 
ordinal scores are treated as interval 
scores and even if one agrees with 
Woodruff's assertion that “religious 
experience has an important effect on 
the value patterns of young people’ 
(208, p. 147), his study can hardly be 
said to furnish unequivocal demon- 
stration of this effect. Many other 
factors (socioeconomic status, region, 
college) vary along with religion, the 
conditions under which an effect may 
be said to be important are not pre- 
cisely defined, and religion as an inde- 
pendent variable (religious experi- 
ence) is not clearly differentiated from 
religion as a dependent variable (test 
response). 

A more convincing demonstration 
of the effect of varying religious back- 


grounds is provided by Kalhorn’s 
comparison of Mennonite and non- 
Mennonite children (91). Besides ex- 
pected differences in things and ac- 
tivities valued, her study reveals that 
in the former group more events are 
perceived in relation to religious val- 
ues, and sources of authority are more 
likely to be religious. 

On the Allport-Vernon, Southern 
Negro ministers were found to score 
highest on religious, lowest on aes- 
thetic (148). But since similar results 
are obtained from members of their 
congregations, from nonchurch mem- 
bers, and from Southern white and 
Negro women college students (52, 
69), factors other than religious back- 
ground, especially limited cultural 
advantages and discriminatory prac- 
tices, are apparently more important 
in producing this pattern. 

Regional, national, and other cul- 
tural differences. The more generally 
rural character of the South as com- 
pared with other regions raises the 
question of possible rural-urban dif- 
ferences in values. However, Landis’ 
hypothesis that rural girls would be 
more economic and less aesthetic than 
urban girls was not verified by his 
data (100). Sometimes a positive 
relationship between the urban-rural 
(or town size) dimension and radical- 
ism-conservatism is reported (38), 
though in other instances the rela- 
tionship is found to be negligible (71), 
while in still other cases, liberalism- 
conservatism differences among com- 
munities of .‘ifferent sizes have been 
discovered in certain value areas but 
not in others (93). Dissenting results 
seem at least partly due to differences 
in degree of generality in the use of 
the terms “‘liberal’’ and “‘conserva- 
tive’ and in the degree of mechaniza- 
tion of the rural sample. 

Class differences in values and 
valuings have not been overlooked. 
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Centers (38) cites evidence showing 
that whereas the characteristic value 
of the middle class is self-expression, 
that of the working class is security. 

Some information for racial or cul- 
tural comparison of evaluative atti- 
tude is provided by Morris (130). 
When he compared the preferences of 
Asiatics and Americans for various 
paths of life, he found no support for 
the stereotypes of the patient, re- 
signed, receptive Oriental and the ac- 
tive, aggressive American. Each 
national group preferred in about 
equal amounts paths characterized by 
dependence and warmth, by striving 
and dominance, and by detachment 
and passivity. His samples were, 
however, of university populations, 
and their typicality might be ques- 
tioned; college students might be 
more alike in their “ideal ways to 
live’’ than men-on-the-street would 
be. The fact that the time of his 


sampling was post-World War II, 


an era of astounding change, may 
further account for the degree of 
similarity in the different cultures. 
According to Lo’s survey of Chi- 
nese college students (110), simi- 
larities between Chinese and Ameri- 
cans in moral judgment are also more 
obvious than differences. 
McGranahan (119) finds in the 
expression of social attitudes evi- 
dence of differences in general pat- 
terns of values between post-war 
German and American youth. As 
might be expected, the latter value 
individual action, freedom of expres- 
sion, etc. whereas the former value 
power, subjugation to the state, etc. 
Results of cross-cultural surveys 
should, of course, be interpreted with 
caution for behavior readily regarded 
as manifestations of certain values 
may not be so interpreted when addi- 
tional aspects of cultural organiza- 
tion are taken into account. In in- 


stances where data have been col- 
lected by means of group tests or 
check lists of behavioral adjectives, 
additional devices requiring the sub- 
ject to “evaluate” or further specify 
the meaning of his check marks have 
proved helpful in making intergroup 
comparisons (101). 

Specific attitudinal differences. Psy- 
chologists, even with the aid of an- 
thropologists, have not formulated 
the relationship between values and 
attitudes in a wholly satisfactory 
manner. Linton (109) has suggested 
a stimulus-response distinction, val- 
ues being stimuli; Kluckhohn (96) 
differentiates them on the basis of 
attitudes being individual and lack- 
ing imputation of the desirable. Some 
psychologists (e.g., Cantril, 32) seem 
to regard values as more general and 
basic than attitudes. 

Woodruff and DiVesta (209) fur- 
nish some experimental data to indi- 
cate that a person's attitudes toward 
an event are a function of nis concepts 
of how his strong values will be af- 
fected by that. event, but the most 
adequate demonstration of this would 
involve controls not employed by 
Woodruff and DiVesta, namely, 
matched groups, some of which would 
be exposed to positive propaganda, 
others to negative. 

Without being overconcerned for 
the definitional problems involved, 
certain investigators have discovered 
some relationships between values 
and social attitudes as measured by 
standard tests (55, 135, 141, 156, 
167). The observed relationships 
between values and attitudes are, 
however, far from univocal. Certain 
strong values may be irrelevant to 
some attitudes (169), and not all dif- 
ferences in attitudes can be inter- 
preted as differences in values (166). 

The interrelationships of values, 
attitudes, and concepts are, however 
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analyzed, not at all simple. Besides 
a rather acute semantic problem, the 
investigator is faced, as discussed in 
a later section, with the fact that 
cognitive structures are themselves 
partly determined by value systems. 

Values and expressive behavior. 
Differences in dominant value are 
observable in expressive behavior, 
both in writing and in speaking. 
Graphologists were able to name, 
with an expectancy far better than 
“chance,"’ dominant values of a se- 
lected sample of handwritings (34); 
value profiles were drawn which 
showed “conspicuous agreement” 
with those obtained from the Allport- 
Vernon (35). 

Laymen as well as psychologists 
match with better than the usual 
chance expectancy the dominant 
value of speakers who are hidden 
from view (3, 56). Since, in the 
latter, more comprehensive study 
part of the speakers were women, sex 
differences may unknowingly have 


contaminated the results. A repeti- 


tion with all male or female “‘types 
would seem desirable. 

Though none of the identifications 
approach perfection, the frequency 
of identifications of better-than- 
chance expectancy is rather remark- 
able in view of the somewhat limited 
sample of behavior in each instance. 
Such accuracy, assuming it trans- 
cends the sex-difference factor, seems 
to attest the rather permeating char- 
acter of one’s value system. An inter- 
esting variation would explore indi- 
vidual differences in the degree to 
which values are expressed in writing 
or speaking. 

Self-ratings. After considering val- 
ues in relation to well-known group- 
ings and traditional performance 
variables, one may wonder about 
value systems as perceived by the 
valuer. 


Vernon and Allport (195) consid- 
ered self-ratings unsatisfactory as 
validating criteria for their test be- 
cause of the differing connotations of 
the value names and descriptions. 
Others have found self-ratings ac- 
ceptable indicators of values, though 
large individual differences in insight 
were noted (7, 57, 141, 175). Caution, 
however, must be exercised in inter- 
preting such insight since ‘‘poor in- 
sight” here may be due either to lack 
of self-knowledge or to inadequate 
understanding of the instrument by 
which insight is judged. 

The therapeutic situation should 
provide another fertile field for in- 
vestigating insights into one’s own 
values; useful information concerning 
the relationship between therapy and 
value-insight might be obtained by 
giving a value test and asking for 
rankings before therapy and repeat- 
ing the procedure at the termination 
of therapy. 

Summary statement. For two dec- 
ades now the Allport-Vernon Study 
of Values, only occasionally replaced 
by some other value test, has been a 
popular instrument in differential 
psychology; it has received wide- 
spread, sometimes uncritical usage. 
Often investigators have reported, 
without over-all evaluation, signifi- 
cant relationship between one value 
and some other variable. Sometimes 
repetition has confirmed the findings; 
but in other cases such isolated find- 
ings should be considered more sug- 
gestive than established. 

In view of the nature of the basic 
responses required by the Allport- 
Vernon (rankings), the purist can 
argue: that the usual scoring proce- 
dures and the subsequent treatment 
and interpretation of the test results 
violate the nonadditive character of 
ordinal measurement; that strictly 
speaking, all its statistics should be 
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nonparametric (i.e., in terms of num- 
ber of persons in whom a given value 
ranks high or low). Pragmatically, 
however, customary practices appear 
to be justified. 

Improved understanding of the 
relationship between value systems 
and other variables seems to hinge 
on two research policies: one, capi- 
talizing on the interdisciplinary ap- 
proach; the other, putting the indi- 
vidual back in individual differenc.s, 
making the person, not the particular 
variable, the unit of interest, utilizing 
devices like Stephenson’s Q technique 
more often, and studying more in- 
tensely the characteristics both of 
those individuals in whom expected 
trends are manifest and those who 
violate these trends. 


THE DEVELOPMENT OF VALUES 


Children are born into a society 
where norms and values are estab- 
lished, but the process of interiorizing 
them is one about which a child psy- 
chologist recently remarked, ‘‘We are 
not exactly certain how children ac- 
quire the fairly stable value-systems 
of adult life’ (180, p. 555). Hutt and 
Miller (88, 125) preface their theo- 
retical considerations of significant 
factors in the process of value in- 
teriorization with the comment that 
“many aspects of the relationship be- 
tween value interiorization and per- 
sonality formation during the first 
few years of life remain unexplored 
by experimentalist and_ clinician 
alike’’ (125, p. 2). 

Apparently direct personal experi- 
ences with intent to learn do not seem 
to be essential to the process, and the 
reinforcement employed in the in- 
doctrination is not always remem- 
bered later (85, 163, 178). Controlled 
studies have usually dealt with some 
rather limited aspect of the problem, 
and several general schemata have 


been utilized in assimilating the find- 
ings—classical and instrumental con- 
ditioning, ego-superego structuring, 
subject-object (self—not-self) separa- 
tion, and person differentiation. 
While admittedly some schools of 
psychology have contributed more to 
certain problems than others have, 
their differing psychological lan- 
guages may be essentially tautologi- 
cal, and the basis for choice of frame- 
work largely emotional; for this rea- 
son this section will be little con- 
cerned with theoretical bias, being 
primarily restricted to results. 

Early emergence of values and valu- 
ings. Psychologists do not always 
agree as to when value systems or 
social values are first observable in 
children. In reviewing Ehre’s conclu- 
sions that children from two to seven 
are dominated by ‘“‘economic” values 
and those of sense pleasures, Murphy 
and Murphy (133) questioned 
whether attributing a “system of 
values’’ to a young child is not imply- 
ing more conceptualization than is to 
be expected of him. Werner indicates 
that a young child has a flexible scale 
of values, and presents anecdotal 
evidence to illustrate how this scale 
“naturally changes not only to meet 
the demands of the current situation, 
but also with the age level’’ (198, p. 
384). 

Possible indications of fairly well- 
developed social values and attitudes 
in young children may be found in 
the five-to-one ratios of own-group 
acceptance to own-group rejections in 
Catholic, Jewish, and Protestant 
children, aged five to eight (149) and 
in the demonstrated respect for the 
property of others in six-year-old 
children (53). 

The reason given why one should 
respect the property of others re- 
portedly shifts from fear of punish- 
ment in the lower age groups to con- 
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cern for who might be hurt and the 
degree of possible harm in the high- 
school groups. This suggests that even 
though the observable respect may 
be relatively the same for first-graders 
as for adults, it is mediated through 
radically different channels. Now, if 
one asks ‘‘Is the value the same?”’ the 
answer obviously depends upon 
whether one includes the ‘“‘mediating 
mechanism” or judges solely on im- 
mediate effects. If, furthermore, one 
selects the first alternative, he is faced 
with the additional problem of speci- 
fying which mediating mechanisms 
permit one to label choice behavior 
as evaluative. Some investigators 
apparently demand a more compli- 
cated covert response than others. 

Some age trends. Turner (190) fur- 
nishes evidence that social values, as 
represented by altruism,appear rather 
early, but within the range of his 
sample (9- to 16-year-old boys), age 
was not found significantly related to 
altruism. Developmental studies of 
ethical insight (84), selfishness (191), 
and interiorization of moral norms 
(18), however, would suggest that 
Turner's instrument was not sensitive 
enough to reflect differences in al- 
truism which may be occurring dur- 
ing those years. 

The change in aesthetic evaluation 
with age is simply demonstrated by 
Thompson (178). Preferences for 
rectangles shift from those of width- 
length ratios of .70~.75 for sixth-grade 
children to those approximating ‘the 
golden mean for adults. Thompson 
advances a cultural-familiarity hy- 
pothesis to account for these adult 
preferences. Testing such a hypothe- 
sis, though possible only through ex- 
tensive ecological surveys for indi- 
vidual subjects somewhat in the man- 
ner recommended by Brunswik (30), 
might well be worth the effort ex- 
pended. 


The fuller appreciation of social 
values as expressed by increasing dis- 
crimination in teacher-approved and 
teacher-disapproved acts (205) and 
in peer-acceptable and peer-unac- 
ceptable behavior (179) has also been 
carefully plotted. These do not, how- 
ever, necessarily attest increasing 
correct appraisal of the situations, 
since this could be measured only by 
first determining how much the 
teachers (or peers) actually did re- 
ward and punish each behavioral act. 

Other worthwhile projects might 
be directed toward accumulating sim- 
ilar evidence on other institutional- 
ized values (e.g., economic, religion, 
etc.). 

Many revealing data and stimu- 
lating hypotheses concerning the 
genesis and growth of psychological 
processes are furnished by Piaget 
(140). Using his well-known methods 
of observing, questioning, and con- 
versing with children, he extends his 
familiar theories regarding the ego- 
centricity of the child into the moral 
sphere. The young child, he main- 
tains, is a moral realist, a product 
partly of his realism in general and 
partly of adult restraint. Acts, says 
Piaget, are judged predominantly in 
terms of objective responsibility, that 
is, by material consequences instead 
of by intentions (cf. Lee's description 
of value in primitive societies [103]). 
Cooperation and autonomy later re- 
place egocentrism and_ restraint. 
Piaget’s picture of the developing 
concept of morality and “rules of the 
game” in maturing children is in 
many respects similar to Rogers’ 
account of the changing concept of 
values during therapy (154). 

The rather rigid emphasis on stages 
and leaning toward hereditary fac- 
tors, for which Piaget has been 
criticized, are not so prominent here, 
the experiential factors in the devel- 
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opment of obligation being given 
more weight. His seeming disdain 
for statistics has, however, not dis- 
appeared. 

Family, social, and economic factors 
im norm interiorization. The most 
conspicuous factor in the develop- 
ment of an individual’s social values 
appears to be his home and family 
life. Democratic atmosphere of the 
home, interparental relationships, 
and parental attitude toward peer 
(child’s) activity were shown to be 
specially significant in character de- 
velopment (25). Children were found 
to resemble their parents in moral 
knowledge, but were not significantly 
like their club leaders, school teach- 
ers, or Sunday School teachers (79). 

The appropriateness of the tech- 
niques used in exploring these latter 
relationships—between children and 
adult leaders—may be questioned. 
Certainly one may wonder whether, 
if one or two groups of children with 
equivalent moral knowledge were ex- 
posed to a teacher with one kind of 
moral bias while the other group was 
under a teacher with a somewhat 
opposing bias, their moral knowledge 
would not, after a short while, be dif- 
ferent. The possibility remains, how- 
ever, that the insignificant correla- 
tions are a function either of the less 
impressionable age of the child when 
under others’ influence and his rather 
limited association with them, or of a 
highly restricted range of moral 
knowledge in adult leaders. Highly 
selective factors operate in determin- 
ing who will be teachers and leaders; 
whereas almost anybody can be a 
parent or a friend. 

The child's changing concept of his 
ideal self shows, according to Havig- 
hurst, Robinson, and Dorr (80), the 
decreasing importance of family fig- 
ures with increasing age. They main- 
tain that his identification around six 


to eight is with parents, that it moves 
outward from the family circle be- 
coming stabilized in late adolescence 
around a nonexistent composite per- 
son. Since none of their children were 
under ten, their investigation only 
partly confirms their hypothesis. 
When the proportion of younger 
children (10-12 years) who name the 
parent as the person whom they wish 
to be like is compared with that of 
older children (16-17 years), no sig- 
nificant difference is found; the num- 
ber describing a ‘‘composite ideal” is, 
however, different for the two groups. 

On the basis of his study of the 
developing ideologies of school chil- 
dren, Bavelas (17) claims that the 
activities listed by children as praise- 
worthy shift from mere conformance 
and nondisturbance to excelling, and 
that the sources of approval-disap- 
proval show changes from peers to 
school principal. Since only percent- 
age figures are cited, the reliability of 
these shifts is not known. 

Probably of more significance than 
his particular results is his emphasis 
on the conflicting values in the life 
space of a child: what is of positive 
value at the club may be of negative 
value at home. Most social scientists 
recognize and comment at length on 
this conflict of values, but few exten- 
sive research programs in psychology 
have attacked the problem by follow- 
ing children in all their various sub- 
groups, observing how they learn the 
values of each subgroup and under 
what circumstances conflict in values 
arises, and how “higher-order” values 
may serve as mediators. 

The influence of social class on 
acculturation has been skillfully ana- 
lyzed by Davis (45) who emphasizes 
the difficulty of value interiorization 
for members of outgroups because of 
systems of rewards and punishments 
different from that of ingroups. 
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Possible influence of socioeconomic 
status in the value-interiorization 
process is further revealed in a com- 
parison of the answers of two groups 
of children, one of relatively high, the 
other of low socioeconomic back- 
ground, to a problem of discipline 
(47). The high group gave relatively 
more constructive responses, settling 
the problem on the basis of reason, 
and relatively fewer nonconstructive 
solutions in which appeal was made to 
authority or punishment mentioned. 
Unfortunately for a definitive con- 
clusion, socioeconomic status was 
confounded with schooling difference, 
“high’s” being in a progressive school, 
“low’s"’ in a traditional. 

Some of the group influences on 
norm formation and modification 


have been studied experimentally. 
Sherif (163) demonstrated that in an 
unstable situation a group will estab- 
lish norms peculiar to itself; individ- 
uals later joining the group will tend 
to adopt the established norm, which 


in turn may be modified in the direc- 
tion of the joiner’s standards, and 
persons leaving the group usually 
retain the group norms. (Sherif’s 
work on the formation of norms under 
laboratory conditions is likely to 
remind one of Helson’s experiments 
on level of adaptation [81]. Although 
this concept has thus far been largely 
restricted in application to percep- 
tual data, it should prove extremely 
useful in investigations of value de- 
velopment.) 

The extent to which an individual 
identifies with a group would un- 
doubtedly determine in part how 
much his own value structure would 
be influenced, and conversely, his 
degree of acceptance would partly 
determine how much the group norm 
would be modified in his direction. 
Lewin and Grable even regard this as 
a cardinal principle of value orienta- 


tion: ‘The individual accepts the new 
system of values and beliefs by ac- 
cepting belongingness to a group” 
(108, p. 62). Such a principle is also 
clearly demonstrated in Newcomb’s 
survey of attitudes in a college com- 
munity (135). 

Lewin (107) has successfully ap- 
plied this and other of his notions 
concerning value modification to the 
problem of changing food habits dur- 
ing World War II. The values under- 
lying food choices were analyzed, and 
the desired changes linked to estab- 
lished values (e.g., health, status, 
etc.) which were then increased in 
potency by being connected to other 
values not ordinarily ascociated with 
food choice (e.g., patriotism). The 
most effective mechanism of change, 
he reported, was group decision. 

Value changes in college. The role 
that colleges play in shaping the 
values of students has been variously 
represented. Sometimes the college is 
seen as a “cultural rubber-stamp,” 
further entrenching the status quo. 
College students have been described 
as highly conservative in value orien- 
tation (68), and the constancy of 
values during college years has been 
noted by several! investigators (31, 
59, 188, 202). 

In other instances the ‘“molding”’ 
character of the college is observed. 
Differential changes in the relative 
strength of values (9) and in the 
growth of moral judgment (14) have 
been noted in major prograins of 
study. Newcomb (135) reports 
changes in students’ attitudes and 
values which were progressive from 
freshman to senior year and com- 
munity-wide (i.e., not restricted to 
any single major program) and which 
reflected increasing dissatisfaction 
with the status quo. 

The exact role of colleges in value 
development remains undefined. Cer- 
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tainly wide differences both among 
and within institutions exist: while 
some universities may simply reflect 
the cultural pattern, others may 
strive to modify it in the light of 
increased knowledge Then, too, 
there is undoubtedly a wide range in 
student flexibility-rigidity in value 
orientation, i.e., their susceptibility 
to change in valuings. Partitioning 
the variance attributable to these 
different sources and their interac- 
tions would be a very complex task. 

Personality organization and value 
modification. In addition to these 
more external features, some relation- 
ships between manner of coping with 
values and other developing psycho- 
logical functions may be observed. 
Spranger’s psychology (173), the 
ultimate source of much of the con- 
temporary work on value, is, of 
course, based on the idea that an 
individual’s fundamental values and 
valuings are reflected in all his be- 
havior. Lecky (102) considered the 
individual’s valuation of himself the 
organizer of all his values. 

In her account of the ethnocentric 
child, Frenkel-Brunswik describes his 
behavior as characterized by “rigid 
dichotomizing, aggressiveness, fear of 
imaginary dangers, of threats, and of 
deprivation, and exaggerated adher- 
ence to conventional values such as 
cleanliness and order’’ (63, p. 556). 
Such a coordinated picture of psycho- 
logical modes in this rather spe- 
cialized type of child as well as those 
from Weiland (197) on Jaensch’s 
typology suggests that in studying 
value interiorization more attention 
might profitably be devoted to this 
“form” of behavior in relation to 
other “forms” of behaving. 

Values in older groups. 


Psycholo- 
gists are showing increasing interest 
in the behavior of persons somewhat 


beyond college age. Though their 


contributions to gerontology have 
been largely in “‘intellectual’’ areas 
(creativity, vocabulary, etc.), some 
investigations have been concerned 
with the values and goals of the aged 
and aging. 

Kuhlen and Johnson (99) report 
certain changes in goals with age, e.g., 
increased importance of retirement, 
decreased desire to change jobs. Be- 
sides the possibility of selective fac- 
tors in their cross-sectional survey of 
teachers, one might question how 
profound a change is represented by 
their findings. Obviously certain 
physical adjustments are demanded 
of the aged, and surely some goals are 
realized as the person grows older, 
raising others in importance. Perhaps 
Strong’s observation (176) that, in 
general, the things liked at 25 are 
better liked, whereas those disliked 
are disliked more, as one grows older, 
is a more adequate description of age 
changes in interests, goals, or values. 

When college students were com- 
pared with their parents on Allport- 
Vernon scores, Fisher (60) found that 
the parents were farther from neu- 
trality; this, she believes, may be due 
to better organization of the value 
structure with increasing years. 
Daughters were more aesthetic than 
mothers, and sons than fathers; 
mothers were more economic than 
daughters, and fathers than sons. 
Sons were more political and theo- 
retical, less social than fathers. 

When standards for judgment were 
considered, Anderson and Dvorak 
(5) observed that grandparents of col- 
lege students are more likely to em- 
ploy “right-wrong” standards; their 
data further suggest that the college 
students are more likely to make de- 
cisions on the basis of prudence and 
aesthetic consequences. 

The crucial data concerning values 
in the aged and aging will, as in other 
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developmental problems, have to be 
derived from longitudinal studies. 
Questions like “‘Do value structures 
remain relatively the same, or if not, 
are certain patterns more predisposed 
to change?” and “Is biological selec- 
tion operative such that some value 
patterns are more likely to lead to 
longevity?’’ are hardly answerable 
cross-sectionally. 

Summary statement. Values have 
been studied at many points along the 
age continuum, and a few of the sig- 
nificant influences in the formation 
and reorganization of value struc- 
tures have been isolated. Some of the 
research is largely ‘‘suggestive,’’ lack- 
ing the precision desirable for ‘‘cer- 
tainty."" Most investigations are 
cross-sectional, leaving room for 


doubt as to the locus of any obtained 
differences. The need is great for 
longitudinal studies which do not 
lose “‘lifelike complexity’’ when sub- 
jected to adequate control. 


VALUES AS REFLECTED IN 
CoGNITIVE LIFE 

During the past decade psycholo- 
gists have been examining values as 
they function in organizing cognitive 
processes. Effects of this organization 
are supposedly manifest in several 
ways—in a selected, accentuated, 
more easily fixated, less variable re- 
sponse. This section will be devoted 
to these various manifestations. 

Accentuation. Because the search 
for ‘‘first’s’’ is likely to lead to infinite 
regress, or to Plato and Aristotle, 
Ansbacher’s finding (8) that judg- 
ments of quantity are influenced by 
the monetary value of the objects per- 
ceived will be arbitrarily selected as 
the initial attempt to explore system- 
atically the effect of values on per- 
ception. His investigation, it is true, 
was inspired by some of the theories 
and experiments of Brunswik and his 


students (29), but there, although 
value became an experimental vari- 
able, the central problem was ‘‘thing- 
constancy” relationships. 

The action of value in perceptual 
or cognitive responses was further 
explored by Bruner and Goodman 
(26) who report significant differences 
between size estimation of coins and 
gray discs, the former appearing as 
larger. They found, furthermore, 
that generally the greater the value 
of the coin, the more accentuated it 
is likely to be and that coins are likely 
to be estimated as larger by poor than 
by rich children. 

Bruner and Goodman have been 
criticized for failing to rule out other 
possible perceptual differences than 
the one investigated (137). Carter 
and Schooler (36) maintain that ac- 
tual size was not taken into proper 
account and question whether ac- 
centuation operates when the object 
to be judged is physically present and 
clear. They found little difference in 
accuracy between size judgments of 
the smaller coins and discs, but larger 
coins were judged as larger than com- 
parable discs. A tendency for the 
smaller coins to be underestimated 
and the larger overestimated was ob- 
served. Poor and rich children, they 
also found, make essentially the same 
judgments when coins are present, 
though the former are more prone to 
accentuate size when coins are ab- 
sent. 

After methodically investigating 
some of the major procedural differ- 
ences between the Bruner-Goodman 
and the Carter-Schooler experiments, 
Bruner and Rodrigues conclude that 
“the value of objects does not une- 
quivocally affect their judged abso- 
lute size”’ (28, p. 24), but that relative 
accentuation is characteristic of 
valued objects (i.e., more increase in 
overestimation in going from a penny 
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to a quarter than is found for discs 
of comparable sizes). 

Under conditions of immediate 
memory, Dukes and Bevan (51) fur- 
nish evidence which supports the 
theory of increasing accentuation 
with increasing monetary value. 
Data obtained from subjects in 
hypnotically-induced economic states 
(12) are in essential agreement with 
those of Bruner and Goodman, but 
the danger of suggestion uncon- 
sciously extending to include ‘‘good”’ 
results cannot be overlooked. 

Accentuation is also evident when 
the values are symbolic, with both 
positive and negative values leading 
to accentuation (27, 51). 

Much of the value-cognition re- 
search has been criticized for dealing 
with an abstract perceiver, and for 
the artificiality of the experimental 
situations (111). When some of the 
neglected variables—intensity of 


value, figural properties, task-diffi- 


culty—were included, results yielded 
only equivocal support to the hy- 
pothesis that value-tinged discs 
tended to be accentuated. ‘Individ- 
uals’”’ and ‘“‘individuals Xdiscs”’ were 
significant sources of variance under 
all conditions. On the basis of this 
analysis a straight accentuation hy- 
pothesis was deemed overly simple, 
and the individual and how he is con- 
stituted to cope with values consid- 
ered the proper locus for value-cog- 
nition studies (94). 

Selection. The tendency for a 
valued object to be selected percep- 
tually from among a group of per- 
ceptual objects has also been empha- 
sized by Bruner and his colleagues. 
Words representing values ranked 
high by individuals were found to 
have lower visual recognition thresh- 
olds than those ranked lower (144). 
Value orientation, it was theorized, 
operates in a dual fashion: in the role 


of ‘‘sensitizer,"’ lowering thresholds 
for acceptable items, and in the serv- 
ice of perceptual defense, raising 
thresholds for unacceptable stimuli. 

Similar results were obtained with 
direct measurements of value prefer- 
ences (ranking words used as stimu- 
lus objects according to liking) (72) 
and with stimuli presented auditorily 
instead of visually (192). 

Solomon and Howes (171), how- 
ever, maintain that the threshold dif- 
ferences are primarily a function of 
frequency of word usage. When both 
relatively frequently used and rela- 
tively infrequently used words were 
employed as stimuli, no systematic 
variation in recognition threshold 
with value rank was apparent, but 
at every value rank, a threshold dif- 
ference between frequent and infre- 
quent words was discovered. 

When Postman and Schneider 
(145) repeated the Solomon-Howes 
experiment, including a recall test as 
well as a recognition one, they found 
that under the latter condition value 
is not a significant source of variance, 
whereas frequency and the interac- 
tion of frequency and value are. 
Under conditions of recall the picture 
is reversed, value rank being the only 
significant source of variance. The 
reported relationship is U-shaped, 
the most and least significant values 
(high and low on the Allport-Vernon) 
being favored in memory (cf. 27, 51 
in accentuation). These results, indi- 
cating that value operates selectively 
in recall but not in recognition pro- 
vide illustration of the principle that 
the organizing aspects of valued 
stimuli are proportional to the am- 
biguity of the situation (cf. 26, 36). 

The Postman-Schneider results, 
however, seem to be in partial oppo- 
sition to the original Postman- 
Bruner-McGinnies work according 
to which one would expect individ- 
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uals to ‘‘defend themselves perceptu- 
ally,’ and give higher recognition 
thresholds for items more ‘‘unaccept- 
able to consciousness."" Perhaps lo- 
cating the values sampled on each 
individual value continuum (i.e., di- 
rection and relative distance from 
some zero point) might help to resolve 
some of the conflicting results. 

Any attempt to explain away ob- 
served relationships between value 
and perceptual sensitivity on the 
basis of frequency (as in Solomon 
and Howes) seems a little short- 
sighted since the obvious question is 
“Why is one group of words more 
familiar to this person while another 
group is more familiar to that one?’’ 
and an obvious answer is in terms of 
value or interest. 

Rigorous control of both frequency 
and value was effected by Mausner 
and Siegel (122) by assigning various 
monetary values to completely un- 
familiar stimulus objects. After the 


value of each object had been learned, 


identification was tested tachisto- 
scopically; no differences in recog- 
nition time were found between the 
variously-valued stimuli. One may 
wonder about the degree of learning, 
i.e., would positive results have been 
obtained if the values, as in life situa- 
tions, had been greatly overlearned? 
(cf. 8 for accentuation of familiar 
items). Their negative results, never- 
theless, further emphasize the need 
for redefinition of the problem, not in 
terms of “does or does not value 
operate as selector and accentuator?"’ 
but rather in terms of “under what 
cond:tions does it so operate?”’ 

The selective function of values is 
further illustrated by an increased 
readiness to respond to words repre- 
senting valued events (32, 117). 

Fixation. The general relationship 
between amount and quality of re- 
ward on learning performance is well 


known to the learning theorist, speed 
of learning being one of the means of 
measuring motives. On the level of 
human learning this relationship is 
evidenced by the fixation of items in 
areas of greater relative value (16, 54, 
106, 118). 

Fixation as affected by values in 
the practical situation of college edu- 
cation was discussed previously (22). 

Response variability. Other indi- 
cations of the operation of value in 
perceptual judgment are to be found 
in the decreased variability in esti- 
mates of valued objects presented 
visually (21) and kinesthetically (50). 
Vernon's criticism (194) of the for- 
mer, that since the patterns of the 
valued and nonvalued stimuli were 
different, variability differences could 
equally be due to configurational dif- 
ferences, would hardly be applicable 
in the latter instance. 

Social perception. in most of the 
value-cognition research cited so far, 
the stimulus objects have been coins, 
words, discs, etc. Special mention of 
the perception of other persons is not 
intended to imply a different order of 
psychological phenomena. In social 
perception the same psychological 
functions—accentuation, selection, 
etc.—may be illustrated, though per- 
haps the medium is more compli- 
cated. 

To test the hypothesis that “‘indi- 
viduals tend to attribute their own 
values to associates whom they 
choose”’ (147, p. 356), Precker secured 
ratings of a list of items from stu- 
dents and faculty and nominations 
from students of preferred faculty 
advisers. He then requested the 
students to rate the items as they 
thought their chosen adviser would 
rate them. His data support his 
conclusion that ‘‘first, we choose as- 
sociates who demonstrate value-simi- 
larity in one or more areas of be- 
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havior, and second we fill the gap, so 
to speak, by attributing to them 
value-similarity in other areas” (147, 
p. 361). 

Granting that most persons under 
circumstances similar to those in the 
experiment do attribute to their as- 
sociates their own values, one can 
without contradiction contend the 
obverse. That is, in some cases of 
demonstrated value-similarity in cer- 
tain areas of behavior, persons may 
“fill the gap” by attributing value- 
dissimilarity in other areas— 
“sharpen”’ instead of “‘level.”’ A cru- 
cial study would be one that made 
clear when one might expect the 
positive or the negative process. 

The question of negative choices 
may also be raised: how do these same 
students conceive the rating of fac- 
ulty members whom they would 
especially dislike as advisers or those 
toward whom they feel indifferent? 

Some pertinent information on this 


point is provided by Fensterheim and 
Tresselt (57). When subjects consid- 
ered their own high values as best 


descriptive of photographs, these 
were liked significantly better than 
when they considered their own low 
values as best descriptive. Comple- 
mentary results were obtained in the 
case of least descriptive values. Such 
results would suggest that Precker 
would get different projections with 
disliked advisers. 

Contrary to their expectancy, low- 
ranking values were most often pro- 
jected into magazine photographs, a 
result which they explain on the basis 
of “‘strangeness."”” This conjecture, 
of course, raises a multitude of prob- 
lems—defining strangeness, locating a 
point of shift from attributing high- 
ranking values to attributing low- 
ranking ones, etc. One simple pos- 
sibility is that the group of photo- 
graphs used as stimuli were more 


disliked than liked, and consequently, 
in line with their other findings, 
would be described with low-ranking 
values. 

The literature on the theory and 
application of projective techniques, 
though not often couched in a value 
framework, no doubt provides many 
examples of the operation of values in 
one’s perception of the physical and 
social world. 

Summary statement. Research on 
values and cognition has hardly been 
underway long enough to permit 
much long-range appraisal. Some- 
times results support hypotheses con- 
cerning the organizing role of value; 
in other instances data are noncon- 
firmatory; while in still others alter- 
nate hypotheses are equally accept- 
able. That value cam function as an 
organizer in perceiving and remem- 
bering seems well established; in 
view of widely divergent results ob- 
tained from relatively minor changes 
in attitudinal sets, physical condi- 
tions of testing, etc., specifying when 
it operates (other than saying ‘“‘when 
autochthonous factors are at a mini- 
mum’) constitutes a major problem. 

Despite the efforts of a few investi- 
gators, notably Bruner, to maintain a 
well-integrated, comprehensive theo- 
retical framework, observations by 
different experimenters that seem to 
embody the same or a highly similar 
principle have not always been trans- 
lated into a common psychological 
structure, nor have the findings which 
conflict with existing data always 
been adequately assimilated. 

Regardless of any possible incom- 
pleteness or incongruity in theory or 
experiment, or of the interpretation 
given the data which have been 
gathered in this area, their contribu- 
tion to psychology seems rather out- 
standing. Here, the integration of the 
time-honored categories of conation, 
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cognition, and affection is actualized, 
rather than merely being paid lip 
service, and clinician and experi- 
mentalist find problems of mutual 
interest. 


CONCLUDING REMARKS 


Psychologists have devised a num- 
ber of instruments to ‘‘measure”’ 
value, though a variety of potentially 
useful techniques have been largely 
overlooked. They have assiduously 
applied these instruments to prob- 
lems of ‘individual differences, but 
have, for the most part, neglected the 
individual. 


In an attempt to approach the 
problem of value with broader per- 
spective, the psychologist is cooper- 
ating in research with other social 
scientists. He has also begun to 
probe with some success the prob- 
lems of genesis and development of 
values. 

Within recent years the function of 
value in cognition has been under 
scrutiny. The somewhat new area in 
which values are being investigated 
and their operation in social percep- 
tion promises to be a very fertile field 
of research for both pure and applied 
psychologists. 
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The teaching of psychology has 
long been a major task of psycholo- 
gists, but only a few of them have at- 
tempted to treat it as a problem 
worthy of research. Wolfle’s article 
(71) of 1942 reviewed a wide variety 
of references on the teaching of psy- 
chology but refers to less than twenty 
efforts by psychologists to answer the 
many questions by actual scientific 
research. It is probable that one 
reason for such a lack of research is 
that until recent times psychology 
classes were sufficiently small to al- 
low the instructor the feeling of in- 
timacy that is so necessary for an in- 
tuitive feeling of success as a teacher. 
In the postwar era of Big Classes 
such feelings have greatly diminished. 

Today enrollment in the introduc- 
tory psychology class runs from one 
hundred to three thousand per school 
year, depending on the size of the in- 
stitution. Since 1942 there seems to 
have been a substantial increase in 
the amount of research directed at 
answering questions about the teach- 
ing of psychology, with particular 
emphasis on the elementary course. 
It is the purpose of this paper to re- 
view that research. In general, we 
have restricted our coverage to stud- 
ies which have used psychology 
classes. We shall try to answer four 
questions: 


' Published for the Committee on Research 
in the Teaching of Psychology of Division 2 
of the American Psychological Association. 
Clare Young and Everett Bovard of that com- 
mittee gave helpful suggestions. In addition 
the authors are grateful to C. E. Buxton, 
J. F. Hall, C. J. Herrick, D. G. Marquis, and 
Helen Peak for critically reading earlier ver- 
sions of this manuscript. 


1. What are the goals of the teach- 
ing of psychology? 

. What factors affect the achieve- 
ment of the goals of the teaching 
of psychology? 

. What measures are available to 
the researcher? 

. What are the implications for 
further research? 


Course OBJECTIVES 


The objectives of the first course in 
psychology are of importance to the 
researcher because they indicate the 
dependent variables in any experi- 
ment concerned with student learn- 
ing. Most statements of course ob- 
jectives are of a general nature. 
Wolfle (71) recommended three major 
goals (a) to teach psychology, its 


- technical vocabulary, its facts, and 


its principles; (b) to train students in 
the scientific method as it applies to 
the study of human behavior; and (c) 
to make the elementary course useful 
to the individual student by develop- 
ing his ability to understand and to 
cope with the personal adjustment 
problems of daily life. It is probable 
that these are the course objectives 
of the majority of psychology courses 
today. 

In the summer of 1951, the Carne- 
gie and Grant Foundations spon- 
sored an eight weeks conference on 
the undergraduate curriculum in 
psychology. Buxton, Cofer, Gustad, 
McKeachie, MacLeod, and Wolfle, 
the chairman (72), considered at 
length the problems of the first 
course, and concluded that the stu- 
dent taking one or more courses in 
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psychology should gain to some 
degree: 

A. Knowledge 

1. An understanding of the basic 
problems of psychology and an in- 
terest in every aspect of Man. 

2. A knowledge of a good sample 
of the facts and principles of psy- 
chology. 

3. An understanding of the logic 
and methods of science. 

4. An ability to approach problems 
of human behavior scientifically. 

B. Habits of Thought 

1. Skills of observing psychological 
phenomena. 

2. Awareness of the quantitative 
nature of much psychological data. 

3. Recognition of multiple causa- 
tion. 

C. Attitudes and Values 

1. Curiosity about human 
havior and experience. 

2. Respect for his fellow men. 

However, there are those who feel 
that one should take as one’s major 
course objectives such goals as in- 
creasing the student’s self-insight, 
teaching the student new thought 
processes and a: titudes for democrat- 
ic living, helping the student to work 
out minor emotional problems, and 
other more deep-seated changes in 
the student. In contrast to Faw (21), 
Berrien (6, 15), and Asch (2), who 
have developed courses aimed at such 
goals, the Wolfle study group sug- 
gested that direct attack upon atti- 
tudes and personality problems of 
students was likely to be unsuccess- 
ful; and that, in any case, these 
goals should be by-products rather 
than primary aims of the course. 

The articles of Smith (63) and 
Pressey (54) exemplify contrasting 
positions in this regard. Smith re- 
views the nature of the liberal arts 
setting and decides to make his 
primary aim “... to make the stu- 


be- 
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dents grasp the meaning of the scien- 
tific approach to human nature.” He 
also hopes to encourage fact finding, 
since he feels the educated citizen 
should be prepared to deal with the 
droves of expert fact finders about 
him. Finally, he thinks we should 
admit to the students that the weak- 
est area in psychology today is ‘‘Per- 
sonality,”’ in an effort to counteract 
the effects of popular propaganda to 
the contrary. Keller and Schoenfeld 
(42) have described a course which in 
many respects is aimed at the goals 
described by Smith. Pressey, on the 
other hand, advocates teaching the 
students to apply psychological prin- 
ciples to life. It is his contention that 
failure of the psychology department 
to give such a course results in some 
other part of the faculty doing it. 

While this is a value question, the 
crux of the matter is one of estimat- 
ing the degree and nature of change 
in the student that one can produce 
in a one- or two-semester course in 
psychology. This is an empirical 
question, and we shall see shortly 
that the evidence is equivocal. 


Wuat Factors AFrect ATTAINMENT 
oF GOALS? 


Methods of Instruction 


Student-centered teaching. It is not 
surprising that the locus of the ex- 
perimenter’s interest has been the 
classroom. For the past decade the 
chief impetus to research has been 
the controversy about the advan- 
tages or disadvantages of “‘student- 
centered” teaching. To a large extent 
this is an extension of the older con- 
troversy between the proponents of 
the lecture method and the propo- 
nents of the discussion method of 
teaching. 

While a wide variety of teaching 
methods are described by the labels 
“student-centered,” “‘nondirective,” 
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“group-centered,”’ or “democratic,” 
they have in common the desire to 
break away from the traditional in- 
structor-dominated classroom and to 
encourage greater student participa- 
tion and responsibility. In Table 1, 
we have attempted to list some of the 
ways in which the student-centered 
method is supposed to differ from the 
traditional or “‘instruwtor-centered”’ 
class. Now let us review the experi- 
mental attempts to demonstrate its 
effectiveness. 

One of the best known comparisons 
of student-centered and instructor- 
centered instruction is that of Faw 
(21). Faw’s class of 102 students met 
two hours a week to listen to lectures 
and two hours a week in discussion 
groups of 34. One of the discussion 
groups was taught by a student- 
centered method, one by an instruc- 
tor-centered method, and one group 
alternated between the two methods. 

As compared with the instructor- 
centered class, the student-centered 
class was characterized by more 


student participation, no instructor 
correction of inaccurate statements, 
lack of instructor direction, and more 
discussion of ideas related to personal 
experiences, 

Surprisingly enough, Faw’s major 
measure of attainment of objectives 
was in the intellectual area. Scores 
on course examinations showed small 
but significant differences favoring 
the student-centered method. In the 
area of his major interest, emotional 
growth, Faw’s method of evaluation 
was to ask students in the student- 
centered and alternating classes to 
write anonymous comments about 
the class. Generally these comments 
seemed to indicate that the students 
felt that they received greater social 
and emotional value from the stu- 
dent-centered discussion groups than 
they would have from an instructor- 
centered class. 

A very similar experiment was 
carried out by Asch (2). Asch, like 
Faw, taught all the groups involved 
in his experiment. Three sections of 


TABLE 1 


DIMENSIONS UPON Wuica STUDENT-CENTERED AND INSTRUCTOR-CENTERED 
Mernops May Dirrer 





Student-Centered 


Instructor-Centered 





Determined by group (21) 

Emphasis upon affective and attitudinal 
changes (21) 

Attempts to develop group cohesiveness 


(11) 
Classroom Activities 


Much student participation (21) 

Student-student interaction (47) 

Instructor accepts erroneous or irrelevant 
student contributions (21) 

Group decides upon own activities (47) 

Discussion of students’ personal experiences 
encouraged (21) 

De-emphasis of tests and grades (2) 

Reaction reports (2) 


Goals 


Determined by instructor (21) 
Emphasis upon intellectual changes (21) 


No attempt to develop group cohesiveness 


(11) 
Classroom Activities 


Much instructor participation (21) 

Instructor-student interaction (47) 

Instructor corrects, criticizes, or rejects er- 
roneous, or irrelevant student contribu- 
tions (21) 

Instructor determines activities (47) 

Discussion kept on course materials (21) 

Traditional use of tests and grades (2) 

No reaction reports 
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about 30-35 students were taught by 
an instructor-centered method. One 
section of 23 students was taught by 
a nondirective method, quite similar 
to that of Faw. However, there were 
certain differences between Faw’'s 
and Asch’s experiment. In Faw’'s 
experiment both student-centered 
and instructor-centered classes spent 
two hours a week listening to lec- 
tures. In Asch’s experiment, only the 
instructor-centered classes were sub- 
jected to lectures. While Faw doesn't 
mention grading, one assumes that 
grades were determined by the instruc- 
tor on the basis of the course-wide 
examination. In Asch’s experiment 
students in the student-centered class 
were allowed to determine their own 
grades. 

The interesting thing is that Asch's 
results do not completely agree with 
Faw’s. On the final examination in 
the course, students in the instructor- 
centered classes scored significantly 


higher than members of the student- 
centered class, not only on the objec- 
tive portion of the test but also on an 


essay portion. Note, however, that 
the student-centered class was spe- 
cifically told that this examination 
would in no way affect their grades in 
the course. As measured by the Bo- 
gardus Social Distance scale, attitude 
change in the two sections was not 
significantly different. However, it is 
important to note that, as compared 
with the instructor-centered classes, 
a greater percentage of members of 
the student-centered class improved 
in adjustment as measured by the 
MMPI. 

Asch's students, like Faw’s, had a 
different perception of their achieve- 
ment from that shown by the course 
examination. Faw’'s student-centered 
class did better on the course ex- 
amination than the instructor-cen- 
tered class section but thought they 
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would have learned more if they had 
been in an instructor-centered class. 
Asch’s students rated the student- 
centered class higher than the in- 
structor-centered class in helping 
them to learn the subject matter of 
the course, but they actually scored 
lower. Since the perceptions of both 
groups about achievement were in 
error, one wonders about the validity 
of the student impressions of superior 
accomplishments in change of atti- 
tude and development of self-insight 
which are frequently cited as evi- 
dence supporting the student-cen- 
tered method. 

Probably the most comprehensive 
experiment in this area is that of 
Landsman (44). Landsman experi- 
mented with a student-centered type 
of teaching as contrasted with a more 
directed type of democratic discus- 
sion organized around a syllabus. 
His experimental design involved 
eight classes in a course sequence of 
“Human Development,” ‘Adjust- 
ment,”’ and “‘Learning.”’ Three in- 
structors took part in the experi- 
ment, and each instructor used both 
methods. Outcome measures in- 
cluded the Horrock-Troyer tests, a 
local case history analysis test, 
Group Rorschach, MMPI, autobiog- 
raphies, and students’ reactions. His 
results showed no significant differ- 
ences on any of the measures between 
methods. 

A number of other experiments 
have been carried out with negative 
results. Johnson and Smith (40) also 
found no significant differences be- 
tween “democratic’’ and _ lecture 
classes in achievement test scores. 
An interesting side light of their ex- 
periment was that one democratic 
class gave the class extremely favor- 
able evaluations, while the other 
democratic class tended to be less 
satisfied than lecture classes. Bills 
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(8) also found no differences in 
achievement but did find greater 
satisfaction in the students of the 
student-centered class. Eglash (19) 
found no difference between a discus- 
sion class and lecture class, not only 
in scores on the final examination in 
the course, but also in scores on an 
achievement test administered sev- 
eral weeks after the course had ended. 
Husband’s (39) comparison of large 
lecture and small recitation classes 
also failed to find differences in 
achievement between the two groups. 
However, Hirschman (32) found that 
discussion groups of four students 
each learned significantly more than 
a group of about fifty students who 
spent the same length of time in 
getting the same concepts by dicta- 
tion and re-reading. 

Other outcomes of student-centered 
teaching. While scores on objective 
final examinations seem to be little 
affected by teaching method, there 
are indications that student behav- 
ior outside the usual testing situation 
may be influenced in the direction 
of educational goals by student-cen- 
tered teaching. The classes com- 
pared by Bovard (11) and McKeachie 
(47) differed in the degree to which 
interaction between students was en- 
couraged and in the degree to which 
the class made decisions about as- 
signments, examinations, and other 
matters of classroom procedure. Like 
other experimenters, Bovard and 
Mc Keachie found that the groups did 
not differ in achievement as measured 
by the final examination. However, 
clinical psychologists evaluated re- 
cordings of the class discussion which 
followed the showing of the film, 
“The Feeling of Rejection.”’ Both 
clinicians reported that the “group- 
centered"’ class showed much more 
insight and understanding of the 
problems of the girl in the film. 


Gibb and Gibb (25) have reported 
that students who were taught by 
their “participative-action”’ method 
were significantly superior to stu- 
dents taught by traditional lecture- 
discussion methods in role flexibility 
and self-insight. In the participative- 
action method class, activities cen- 
tered around “‘sub-grouping methods 
designed to increase effective group 
participation.”’ ‘The instructor, who 
played a constantly diminishing role 
in the decisions and activities of the 
groups, gave training in role playing, 
group goal setting, problem center- 
ing, distributive leadership, evalua- 
tion of individual performance by 
intra-group ratings, process observ- 
ing, and group selection, evaluation 
and revision of class activities.” An 
especially interesting result was that 
in nonclassroom groups the partici- 
pative-action students were rated 
higher than the other students in 
leadership, likableness, and group- 
membership skills. 

In addition, Sheldon and Lands- 
man (59) report that in ‘Academic 
Methods” a nondirective teaching 
method was superior to a traditional 
lecture-discussion method in aiding 
students to achieve better grades in 
other courses. Smith (60), however, 
found no differences between three 
methods varying in degree of direc- 
tiveness as to their effects upon stu- 
dents’ ability to make “‘applicational 
transfer’’ of their learning. 

Although McKeachie (49) reports 
changes in attitude of students to- 
ward Negroes and toward the treat- 
ment of criminals, differences between 
leader-centered and group-centered 
classes were not significant. How- 
ever, he reports that group decision 
did produce more accurate percep- 
tion of the group norm and more 
conformity to the norm than lecture 
or discussion without decision. While 
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no direct attempt was made to 
change the group norm, the experi- 
ment would suggest that the in- 
structor who wishes to change atti- 
tudes might find the group decision 
technique useful. 

Interaction of teaching method and 
student personality. While this does 
not exhaust the research comparing 
student-centered and instructor-cen- 
tered methods, let us now focus on 
student personality as it relates to 
these methods. 

Wispé (68) carried out an interest- 
ing variation on the student-centered 
versus instructor-centered experi- 
ment. Instead of attempting to con- 
trol the instructor personality vari- 
able by forcing instructors to teach 
both instructor-centered and student- 
centered classes, Wispé selected in- 
structors who were rated as naturally 
permissive or directive and compared 
their sections of the Harvard course 
in “Social Relations."’ He found no 
difference in final examination scores 


between students taught by different 


methods. Students preferred the 
directive method. 

One of Wispé's greatest contribu- 
tions, however, was in demonstrating 
the importance of interactions be- 
tween teaching methods and student 
personality. Wispé used TAT-like 
measures to differentiate three types 
of students: the “personality-inse- 
cure” student (51 per cent of the 
sample), the “‘satisfied’’ student (26 
per cent of the sample), and the 
“independent” student (23 per cent 
of the sample). The “insecure” 
student had unfavorable attitudes 
toward both instructors and fellow 
students, and had particularly un- 
favorable attitudes toward permis- 
sive teaching. The “satisfied’’ stu- 
dent had favorable attitudes toward 
instructors, fellow students, and both 
directive and permissive teaching 


methods. The “independent” stu- 
dent was highly verbal and wanted 
more permissive teaching no matter 
what method his instructor used. 
The independent student had moder- 
ately favorable attitudes toward fel- 
low students and instructors, but was 
likely to direct aggression against the 
instructor in directive classes. 

In conjunction with Wispé’s re- 
sults, a study of factors associated 
with student choice of directive or 
nondirective classes is of interest. In 
their study of Springfield College 
students, Ashmus and Haigh (4) 
found, as contrasted with Wispé, al- 
most equal numbers choosing each 
method. Students choosing non- 
directive classes did not differ from 
students choosing directive classes in 
ACE percentile ranks or academic 
index, but were more likely to have 
had previous experience in nondirec- 
tive groups. 

Instructor-method interaction. Wis- 
pé’s research also raises the question, 
“Is there a significant interaction be- 
tween teacher and method?” This 
was one of the questions which could 
be answered by the factorial design 
involving eight instructors, three 
methods, and twenty-four sections 
reported by Guetzkow, Kelly, and 
McKeachie (31). The three methods 
used in their experiment were: 

1. Highly-structured recitation-drill 
in which teachers asked specific ques- 
tions which the students answered 
orally; 

2. Discussion, in which the in- 
structor attempted to develop general 
group discussion ; 

3. Study-tutorial, in which stu- 
dents worked and consulted with the 
instructor individually. 

While the methods were not sig- 
nificantly different in their effects 
upon social attitudes, they did make 
a small, but significant, difference in 
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scores on the final examination, rank- 
ing in the order: recitation, discus- 
sion, tutorial. While the instructors 
preferred the discussion method, the 
recitation method was preferred by 
students. An interesting by-product 
was the finding that students who 
initially preferred a method did not 
differ significantly in achievement 
from students who preferred other 
teaching methods. And, to answer 
our original question, the teacher- 
method interaction was not signifi- 
cant. 

Aspects of teaching method. Mc- 
Keachie has suggested (48) that re- 
search on the problem of teaching 
method might be more productive if 
specific aspects of student-centered 
or instructor-centered methods were 
experimentally compared. One of the 
few experiments using this approach 
was that of Deutsch (16), who com- 
pared cooperative and competitive 
grading systems. While his results 
indicated that greater friendliness 
and pride in the group were created 
in groups graded cooperatively, these 
groups did not show greater interest 
or achievement than groups graded 
competitively. 

Other teaching methods. In addition 
to the studies cited above, there 
exists a body of research dealing with 
more limited aspects of various teach- 
ing methods or techniques. Most of 
the studies are unique in the particu- 
lar variables used, and hence it is dif- 
ficult to do more than indicate that 
some interest has been shown in the 
particular area and to report the 
findings. The lack of a series of stud- 
ies in these areas makes the results of 
most of the studies more suggestive 
than definitive. 

Lecture. Morrow (51) compared ex- 
perimental modification of attitudes 
by two lecture methods. An aptitude 
questionnaire (three Likert-type at- 


titude scales) was administered be- 
fore and after the lectures, and to 
a control group which did not hear 
the lectures. Two lectures were 
given. One was a moralistic exhorta- 
tion against prejudice accompanied by 
a review of factsabout race differences, 
and the second was a presentation of 
the role of emotions in distorting 
thinking so as to produce prejudice. 
The group hearing the first lecture 
displayed fewer gross factual mis- 
representations after the lecture, and 
the more prejudiced subjects seemed 
to have more anxiety. The secend 
type of lecture had no effect on fac- 
tual scores but seemed to increase 
anxiety in the less prejudiced sub- 
jects. Morrow's general conclusion is 
that verbal propaganda delivered 
from the lecture podium is too im- 
personal to effectively reduce prej- 
udice. 

Films. The great mass of research 
on instructional films is relevant to 
our topic, even though most of it has 
not been concerned with the teaching 
of psychology. While it would be 
impossible for us to summarize all of 
the relevant studies, certain prin- 
ciples have emerged. 

1. Students can learn from films, 
and usually do learn as much as from 
a poor teacher (33). 

2. Such learning is not confined to 
details, but may include concepts and 
attitudes (33). 

3. It is efficient for learning to re- 
peat the film showing (3). 

Tests. Pressey (55) has published 
the results of an extensive program of 
research with tests which students 
score themselves by punching alter- 
natives until they hit the correct one. 
Four types of studies were carried 
out, and Pressey concludes that (a) 
the self-scoring characteristics repre- 
sent a tremendous saving in time; 
(6) test taking is transformed into 
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self-instruction by the immediate 
knowledge of mistakes; (c) supple- 
mental use of the tests improves per- 
formance on regular objective tests; 
and (d) even more automatic self- 
scoring devices can be devised. 

Pressey'’s results agree with ear- 
lier research summarized in Wolfle 
(71) and the experiment of Fitch, 
Drucker, and Norton (23) to the ef- 
fect that frequent quizzes are effec- 
tive and liked by students. This may 
be an explanation of the superiority 
of the recitation method in the study 
of Guetzkow, Kelly, and McKeachie 
to which we have referred previously. 

Teaching methods: summary. In 
1942, Wolfle summarized research up 
to that time by repeating Longstaff's 
statement of 1932: 

The experimental evidence submitted to the 
present time tends to support the general 
conclusion that there is little difference in 
achievement in large and small classes and, 
also, that it makes little difference as to what 


method of presentation of the materials of the 
course is used (71, p. 701). 


The third decade of research has 
not outdated Longstaff’s statement. 
However, recent research does hold 
forth the promise that in the next 
decade we will have a better under- 
standing of the effect of various 
teaching methods upon student learn- 
ing. 


Student Characteristics 


The teacher of psychology recog- 
nizes that no matter what method he 
uses, his success depends to a large 
extent upon the sort of students en- 
rolled in his class. However, since the 
teacher has little control over this 
variability, little research has been 
carried out on student characteristics 
as related to classroom learning in 
psychology. 

Personality. One of the most recent 
and comprehensive studies of this 
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kind was carried out by Gaier (24). 
Using the Rorschach, Gaier deter- 
mined the relationship between cer- 
tain personality characteristics and 
students’ thoughts in small discussion 
groups. He found that anxiety readi- 
ness, as measured by the Rorschach, 
correlated negatively with scores on 
a reading test and a test requiring 
comparisons of familiar and unfamil- 
iar materials. Rigidity predisposition 
correlated positively with score on a 
measure of rote recall and negatively 
with measures requiring handling 
of new concepts and new methods 
of attacking problems. Negativism 
correlated positively with class time 
spent in negative thoughts about 
other people and negative thoughts 
about ideas expressed in class. 

Another recent and comprehensive 
study on personality correlates and 
academic performance is that re- 
ported by Goldberg and Stern (27). 
Here the researchers first set up a 
theoretical model of the personality 
syndrome being studied, created the 
Inventory of Beliefs as the chief pre- 
dictive instrument (a rigidity-flexi- 
bility kind of distinction) for the 
theoretical model, and then obtained 
extensive data over a period of one 
year on an incoming group at the 
University of Chicago. In addition, 
after the initial data had been ana- 
lyzed, an individual was selected who 
seemed closest to the syndrome de- 
scribed; and a case study was made. 
Correlations were found between the 
personality syndromes taken from 
the extremes of the Inventory scores, 
and excellence in placement tests, 
subsequent courses, ability to adjust 
to university life at Chicago, and 
classroom behavior. A comprehen- 
sive report of these findings is now in 
preparation. 

In attempting to predict success in 
elementary psychology, Magaret 
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and Sherriffs (50) developed an eight- 
item test of the ability to apply psy- 
chological principles to problems of 
parent-child relaticnships. Students 
of the elementary course in psychol- 
ogy were giver. an interview and a 
misconceptions test and were divided 
into two groups: those whose beliefs 
were destined to be contested by the 
course content, and those whose be- 
liefs were destined to be supported by 
the course. At the end of the course 
the students in the latter group made 
significantly better scores on the 
parent-child relationships test, al- 
though there were no differences be- 
tween groups on the objective mul- 
tiple-choice test used as the final ex- 
amination. Cases were found where 
the correct answer was given on the 
objective test despite an answer to 
the same item in situation form indi- 
cating contrary belief. The situation 
test was not used to determine 
grades, i.e., under full achievement 
conditions. Unfortunately, unpub- 


lished research by Birney revealed no 
differences in scores on this test be- 
fore and after students had taken a 
course in general psychology. 

Smith (60) suggests that students 
who learn the facts but have diffi- 
culty in applying them may be more 


anxious or emotionally disturbed 
than students who make such trans- 
fer easily. He found that students 
who scored low in “applicational 
transfer’ were characterized by slow 
responses on word association. 

Intelligence. One of the persisting 
practical problems in teaching is 
whether to grade a student on the 
basis of his level of achievement or on 
the basis of his improvement. The 
difference this might make is well 
illustrated by studies on the relation- 
ship of intelligence to achievement in 
psychology courses. 

Carlson, Fischer, and Young (14) 


found no correlation between intel- 
ligence and gains in scores on a fac- 
tual examination given at the be- 
ginning and end of a psychology 
course. Portenier (53) found that the 
Ohio State Psychological Test gave 
an r of +.68 with grades in the 
course. This was as good as the 
multiple R of +.682 obtained from 
a battery containing the OSUPT, 
American Council of Education Ex- 
amination, lowa High School Con- 
tent Examination, and Nelson-Denny 
test. These findings were borne out 
for three successive years even when 
the textbook had been changed. 
Portenier concludes that a critical 
score of 87 on the OSUPT will give 
high confidence of C work or better in 
the Wyoming introductory psychol- 
ogy course. This supports the earlier 
study of Dysinger and Gregory (17), 
who found that both intelligence and 
previous knowledge of psychology 
were important in determining course 
grades. Newman, Duncan, Bell, and 
Bradt (52) found that achievement 
could be predicted not only by an 
intelligence test but also equally well 
by a psychology achievement test and 
by Buxton’s Misconceptions test. 

Maturity. One of the practical 
problems which involves student 
characteristics is whether or not 
freshmen should be admitted to the 
elementary psychology course. Fre- 
quently proponents of the sophomore 
course argue that freshmen are sim- 
ply not mature enough to handle 
psychological concepts. Roy (56), in 
a study comparing grades of fresh- 
men and sophomores in elementary 
psychology, found that the freshmen 
made significantly poorer grades even 
with ability equated. Earlier, Fisher 
(22) and Gilliland (26) obtained simi- 
lar results although in Gilliland’s 
study the differences were not sig- 
nificant. 
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Conclusions. This research on stu- 
dent and teacher characteristics is of 
impurtance, but thus far only intel- 
ligence and the familiar syndrome of 
rigidity-flexibility seem to have been 
well substantiated as factors of real 
interest for the college teacher to 
consider. An area which has been 
neglected is the all important one of 
motivation. Why do some students 
try harder than others? Conceivably 
research could be done with motive 
measures, such as the achievement 
measure developed by McClelland 
et al. (46), since its reported correla- 
tion with intelligence is low. 

The importance of studies of stu- 
dent characteristics lies in the possi- 
bility that educational goals may be 
more easily attained if student char- 
acteristics are considered in the 
choice of teaching method, in course 
sectioning, or in selection of students. 
Research on student characteristics 
also has another value. By demon- 
strating the importance of relatively 
permanent student abilities and per- 
sonality characteristics in determining 
student achievement, this research 
points to the necessity of research de- 
signs removing this source of variance 
when the experimenter is attempting 
to compare the effects of differing 
teaching methods. The use of gains 
scores as criteria, or the use of intelli- 
gence and personality test scores as 
predictor devices, should add much 
to the sensitivity of the experiment- 
er’s tests of the differences his experi- 
mental variables have produced. 


Instructor Personality 


While a large number of investiga- 
tions have attempted to relate per- 
sonality characteristics of elementary 
school teachers to their effectiveness, 
fewer such studies have been done at 
the college level. The results of two 
studies at the elementary school 


level may be of interest to the college 
researcher. Gotham (28) found no 
relationship between scores on several 
standard personality inventories and 
teaching effectiveness. Anderson, 
Brewer, and Reed's (1) study demon- 
strated relations between teacher be- 
havior and the classroom behavior of 
children. Their categories and obser- 
vational measures of “‘dominative”’ 
and “integrative” teacher behavior 
might be a useful basis for research 
at the college level. Many teacher 
personality studies have been done 
by educational psychologists, but as 
yet the unique interaction of teacher 
personality and the subject matter of 
psychology is uncharted. Are some 
personalities better fitted to present 
some areas of psychology than oth- 
ers? If so, what does this imply for 
course objectives, etc.? It would 
seem that some very basic theoretical 
studies of personality could be done 
in this area. 


MEASUREMENT 


It has not been uncommon for 
persons interested in some particular 
teaching method to insist that the 
method is beneficial to the student 
even if the objective test scores do 
not show it. As we have seen, the 
proponents of the nondirective teach- 
ing method are currently faced with 
difficulties in demonstrating experi- 
mentally the assumed benefits of 
their method because few measures 
exist for the kinds of skills they hope 
to develop in the students. Thus, we 
see that measurement problems are 
of central importance. 


Measures of Classroom Process 


In research on teaching methods, 
one of the difficulties has been ade- 
quate description and measurement 
of the independent variable, the 
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teaching method. In a recent article, 
Wispé (69) emphasizes this very 
point. Commenting on the current 
research on teaching methods, he 
makes the point that attainment of a 
classroom condition, such as permis- 
siveness, cannot be taken as the 
dependent variable and cited as 
proof of a method’s effectiveness. 
Rather, it must be treated as proof 
that the independent variable, i.e. a 
permissive method, is actually pres- 
ent, and then an adequate dependent 
variable of teaching effectiveness 
should be provided. The usual tech- 
nique for establishing the presence of 
the independent variable is obser- 
vation of the class by one or more 
observers using a set of categories 
designed to reveal the desired dif- 
ferences between methods. For ex- 
ample, among the categories scored in 
Wispé's experiment were the number 
and kind of student questions, 
amount of student and instructor 
talking, and the ratio of ‘‘We’s’’ to 
“‘I’s’’ used by the discussants. 
Bovard (10) and McKeachie (49) 
developed measures of group co- 
hesiveness for use in the classroom. 
They asked each student to rank on 
10-point scales his liking for each of 
the other members of the class, his 
liking for the group as a whole, and 
the degree to which he felt a member 
of the group. These measures dif- 
ferentiated their ‘“‘group-centered”’ 
and “leader-centered’’ classes. Wit- 
hall (70) has demonstrated a _ tech- 
nique for measuring the “‘social- 
emotional”’ climate of the classroom 
as a function of the teacher's re- 
marks. Typescripts of the teacher 
statements were judged on a con- 
tinuum of learner-supportive to 
teacher-supportive by five judges. 
Unique patterns for individual teach- 
ers were found and were consistent 
over time. Both students and judges 


agreed on the nature of statements 
creating ‘positive’ emotional feel- 
ings. 


Measures of Course Outcomes 


Knowledge. The usual method of 
evaluating the student’s knowledge 
is an objective course examination. 
The quality of this criterion undoubt- 
edly varies a good deal depending 
upon the test construction skills of 
the instructor. While instructors un- 
doubtedly differ a good deal in their 
stress on different areas of content, 
the use of some well-constructed 
standard psychology achievement 
test as one criterion measure would 
probably enable researchers to more 
clearly communicate their results to 
other experimenters. The USAFI 
examination in general psychology 
was formerly available and was one 
of the measures used in the experi- 
ment of Guetzkow, Kelly, and Mec- 
Keachie (31). In addition, there is 
Hogan’s test (34), which, to our 
knowledge, has not been used in any 
research. 

Scientific method. As we have al- 
ready seen, most psychology courses 
have emphasized scientific goals to 
some extent. Yet there has been rela- 
tively little effort to measure attain- 
ment of these goals. Some work in 
this direction has been done by Ed- 
wards (18), who constructed two 
tests for aspects of critical thinking. 
His article is concerned primarily 
with the steps in constructing such 
tests and demonstrates two tech- 
niques which may be used. One test 
presents scientific facts and then de- 
mands that the student evaluate 
them with reference to some scivn- 
tific principle. The other method is 
designed to make the student produce 
a rank order of statements about con- 
troversial data. While his tests are 
designed for grades 9-14, his article 
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should be helpful to the experimenter 
in college teaching. 

The Watson-Glaser Tests of Criti- 
cal Thinking (66) seem well con- 
structed but have not been used ex- 
tensively in research in teaching. 
Smith and Dunbar (62) report that 
their democratic psychology class 
produced changes in scores on the 
“Generalizations” and “Assump- 
tions” parts of their test. Sanford 
and Hemphill (58) report gains on 
the ‘General Logical Reasoning,” 
“Applied Logical Reasoning,’’ and 
“Survey of Opinion’’ subtests for 
Naval Academy students whose 
course included 14 hours of instruc- 
tion in psychology. 

A pplication of principles. Horrocks 
and Troyer (38) have developed and 
used tests of the ability to use psy- 
chological principles on case histories 
and specific topic areas. They de- 


scribe the construction of the tests 
and report reliabilities in the +.70’s, 


and product-moment _intercorrela- 
tions between tests of around +.60. 
The correct responses to the test 
items are those which are chosen by 
experts. These tests correlate with 
the Ohio State Psychological Test 
about +.23. A correlation of +.30 
was found betweeen the case history 
tests and achievement test scores 
(36). Horrocks and Nagy (37) used 
the Case of Connie test to determine 
the effect of taking psychology 
courses and of experience in the field 
of psychology upon the ability to 
diagnose personality problems and 
upon the ability to recommend re- 
medial treatment. They found some 
indication of a relationship between 
these two abilities (+.44) but no 
differences between any of the groups 
tested, and these ranged from college 
freshmen to graduate students in 
psychology and teachers of college 
psychology. The tests developed in 
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this effort have been published and 
reviewed in the Mental Measurements 
Yearbook (12). These tests should 
have value as teaching aids. The 
tests of Magaret and Sherriffs (50) 
and of Smith (60) described earlier, 
and McCandless’s “Attitude Toward 
People’s Behavior’ (45) should also 
be useful in attempts to evaluate 
ability to apply psychological knowl- 
edge to everyday affairs. 

Karn (41) reports that on the File- 
Remmers “How Supervise?’’ test, 
students in a psychology course made 
greater gains than a control group. 
Wickert (67) found significant gains 
in courses in industrial psychology 
and human relations in industry, but 
not in the general psychology course. 
Kelly and Pepitone (43) also report 
gains in ability to apply human rela- 
tions principles to descriptions of 
human relations problems. They 
coded student responses to these 
problems, not only in terms of appli- 
cation of principles, but also for the 
degree to which the student showed 
tolerance and understanding of the 
individuals described. For many in- 
structors such understanding is one 
of the major goals of psychology 
courses. These are among the few 
attempts to measure it. 

Many instructors use misconcep- 
tions tests as pedagogical devices. 
However, such tests might also be 
used as outcome measures since to 
some extent they measure the degree 
to which the student has carried over 
psychological knowledge into his 
everyday thinking. One of the most 
widely used tests of misconceptions is 
that developed at Northwestern by 
Buxton. In a factor analysis of this 
test, Holley and Buxton (35) dis- 
tinguished four factors: knowledge 
about psychological principles, knowl- 
edge of special terminology, con- 
ventional morality, and overevalua- 
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tion of learning ability. Sanford and 
Hemphill (58) devised a “Common 
Sense”’ test, which contained state- 
ments concerning popular beliefs 
about people and their behavior. 
Their Naval Academy students 
showed significant gains on this test. 

Adjustment, attitudes, and person- 
ality. Gross (29) has constructed, 
validated, and used a scale for meas- 
uring self-insight. Working with 
Cantor, he was interested in provid- 
ing a measure of the kind of changes 
that were felt to take place under the 
nondirective methods of teaching. 
In his second report (30), he describes 
use of the Self-Insight scale to demon- 
strate that Cantor’s nondirective 
teaching (13) leads students to a 
greater amount of self-insight. Per- 
centage of change in score from pre- 
course administration to postcourse 
administration was obtained for an 
instructor-centered class in econom- 
ics and for Nathaniel Cantor's class 
in Dynamics of Learning. The larger 
group increase and the greatest indi- 
vidual increases appeared in Cantor's 
class. 

Sanford and Hemphill (58) found 
significant gains on the George Wash- 
ington University Social Intelligence 
Test for Naval Academy students. 

A large number of the commonly 
used tests of social attitudes have 
been used in general psychology 
courses. In general, no remarkable 
changes of attitudes have been 
reported. 

Anyone interested in measuring 
either affective or cognitive change 
would do well to become familiar 
with the work of Smith, Tyler, et ai. 
(61). In a classical research effort 
to devise measures of student prog- 
ress for the Eight Year Study of 
general education, they faced and 
partially resolved some of the basic 
problems of evaluating critical think- 


ing, ability to apply principles, 
changes in attitude, etc. One salient 
point of their work was the conclusion 
that adequate measures of these abili- 
ties most certainly demand more than 
a single score or measure. 

Interest in psychology. One of the 
goals of most courses is to stimulate 
interest in the subject matter. How- 
ever, there have been relatively few 
attempts to measure this outcome. 
Guetzkow, Kelly, and McKeachie 
(31) asked students to check the ad- 
vanced psychology courses they 
would like to take. As we have seen, 
they found differences between stu- 
dents taught by different methods, 
with students taught by a highly- 
structured method planning to elect 
more advanced courses. These dif- 
ferences were reflected to a smaller 
degree in the actual elections of the 
students. The same experimenters 
also successfully used as a criterion 
the number of students deciding to 
become majors in psychology, and 
also developed a test of attitude to- 
ward psychology. It should also be 
possible to get measures of the 
amount of voluntary outside reading 
of psychological materials students 
do, both during a course and after its 
completion. 

Altitudes toward course. Most in- 
structors prefer that their students be 
satisfied. Hence one of the types of 
measures most frequently used is an 
evaluation by students of the course 
and instructor. Since there is volu- 
minous literature on student evalua- 
tion of faculty, no attempt will be 
made to review it systematically here. 
However, two recent studies may be of 
particular interest to the researcher. 

Elliott (20) found that if an in- 
structor is more effective with the 
brighter students, these students will 
rate him higher than the poorer 
students will and vice versa. 
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Russell (57) found that ratings of 
such characteristics as “contribution 
of text to the course,” “fairness of 
examinations,”’ and other aspects of 
the course were more valid indica- 
tions of an instructor’s effectiveness 
than ratings of his “effectiveness 
as a teacher’ or his “friendliness.” 
In both Elliott’s and Russell's stud- 
ies, the criterion measure of teach- 
ing effectiveness was mean student 
achievement. 

Student thinking. In almost all the 
experiments mentioned earlier, the 
experimenter observed with care 
student and instructor behavior in 
the classroom. Normally, however, 
these observations have been con- 
sidered to be measures of the inde- 
pendent variables; achievement 
tests, attitude scales, or other such 
devices have been measures of the 
dependent variable. One might as- 


sume that what a student gains from 
a class will depend to a large extent 
upon the thinking he does during the 


class period. Bloom (9) has de- 
veloped an ingenious device for deter- 
mining what students think about 
while in classes. His method of 
“stimulated recall’’ consists of re- 
cording a lecture or class discussion 
and playing back the recording to the 
students who were in the class. The 
recording is stopped at certain points, 
and the student is asked to tell what 
he was thinking at that point in the dis- 
cussion. Bloom has used this measure 
in comparisons of the thought proc- 
esses elicited by lectures and discus- 
sions. Preliminary results (5) suggest 
that a greater percentage of student 
thinking relevant to the topic occurs 
in lecture than in discussion, but that 
more problem-solving thought occurs 
in discussion classes. 

A somewhat less elaborate measure 
is the students’ note taking. Billings- 
lea and Bloom (7) used this measure 


in an experiment in which no differ- 
ences in note taking were found be- 
tween students arbitrarily assigned 
failing test grades as compared with 
those assigned good grades. 


Measurement Summary 


While our measurement techniques 
are still lacking in many character- 
istics, it is apparent that the present- 
day researcher has available a much 
more adequate arsenal from which to 
choose than the researcher of an earlier 
day, as indicated in Table 2. Un- 
fortunately, many researchers do not 
seem to be aware of the experiences 
of their predecessors and colleagues, 
and laboriously construct homemade 
instruments. Such new tools are often 
valuable, but frequently good results 
could have been attained with much 
less expenditure of effort if the re- 
searcher had known of previous at- 
tempts to achieve the same results. 


Wuat Is NEEDED? 


The dearth of clear-cut findings in 
the research herein reviewed can be 
blamed on many things—the diff- 
culty of controlling variables, the 
lack of adequate measures, or the 
problems besetting an experimenter 
whose primary responsibility is the 
education of the students who are his 
subjects. But, to us, there is one 
factor which is even more important 
than these—the lack of a theoretical 
basis for the research. 

All too often research in teaching has 
begun with the attitude, ‘‘Wouldn’t 
it be nice to know if . . . ?’’ Curiosity 
is a laudable motive; but unless the 
hypotheses and experiment devised 
to answer the question are related in 
some way to general psychological 
theory, substantial advancement of 
knowledge will continue to be slow. 
Only from a systematic theoretical 
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TABLE 2 
OvuTcoME MEASURES AVAILABLE FOR RESEARCH IN THE TEACHING OF PsYCHOLOGY 








Objective Measure Used by 





A. Knowledge 
1. Knowledge of facts and 
principles 


Guetzkow, Kelly & 
McKeachie (31) 


USAFI examination in general psy- 
chology (65) 

Hogan Psychology test (34) 

Troyer-Horrocks Test of Knowledge 
& Principle in Human Growth & 
Development (64) 

Teacher made measures 

Watson-Glaser tests of critical think- 
ing (66) 


Landsman (44) 


2. Understanding the logic Smith & Dunbar (62) 

and methods of science 

Sanford & Hemphill 
(58) 

Horrocks (36) 

Horrocks & Nagy (37) 


3. Ability to apply psy- Horrocks-Troyer (38) 

chological principles to 

“real life’’ 

Guetzkow, Kelly & 
McKeachie (31) 

Magaret-Sherriffs (50) 

Karn (41) 

Holley & Buxton (35) 

Sanford & Hemphill 
(58) 

Smith (60) 


McCandless Analytic Attitude To- 
ward Behavior (45) 

Magaret-Sherriffs 

File-Remmers ‘How Supervise” 

Buxton Misconceptions Test 

Common Sense Test 


Applicational Transfer Test 

B. Attitudes, values, adjust- 
ment 

1. Curiosity about human 

behavior and experience 


Guetzkow, Kelly & 
McKeachie (31) 
Guetzkow, Kelly & 
McKeachie (31) 
McKeachie (49) 
McKeachie (49) 


No. of further courses in psychology 
elected 

No. of students entering concentra- 
tion 

Likert-Attitude Toward Negro 

Wang-Thurstone Attitude Toward 
Treatment of Criminals 

Gross S-I scale (29) 

George Washington U. Social Intel- 
ligence Test 

MMPI 


2. Less punitive attitudes 
toward people 


Gross (30) 

Sanford & Hemphill 
(58) 

Asch (2) 

Landsman (44) 

Landsman (44) 


3. Self-insight 
4. Social adjustment 


Group Rorschach 





approach will we get the kind of test- 
able hypotheses which contribute to 
greater understanding. 

If the experimenter took time to 
try to analyze the variables involved 
in the experiment he is planning, he 
might often find that some of the 
variables involved could be manipu- 
lated more easily in an experiment 
lasting only one class period than in 
an experiment involving numerous 


variables manipulated over the course 
of a semester. 

More careful analysis of variables 
and outcomes might also help the 
instructor to see the advantages in 
many cases of using gains as criteria 
rather than scores on a posttest. 
When so much of the variance in 
students’ achievement depends upon 
factors beyond the instructor's con- 
trol (notably the students’ intelli- 
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gence and general achievement moti- 
vation), removing variance due to 
these factors by predictor measures 
obviously increases greatly the prob- 
ability of locating outcome differ- 
ences between methods where they 
do exist. 

We do not, however, wish to end 
this review on a pessimistic note. 


WILBERT MCKEACHIE 


The need for dependable knowledge 
about college teaching is as great as 
ever. Many of the initial barriers 
have been overcome. With more ade- 
quate theory, increased empirical 
background, and improved measure- 
ment tools, the researcher of the next 
decade can walk where previous re- 
search in teaching has slowly crept. 
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ROTARY PURSUIT APPARATUS: I. SURVEY OF VARIABLES 


R. B. AMMONS 
University of Louisville 


In the early stages of scientific in- 
vestigation of any area of interest, 
the greater part of the energy spent 
is directed toward discovering on a 
gross level the important variables 
and their effects. After this explora- 
tion is well under way, it becomes in- 
creasingly necessary to develop a 
standardized methodology, including 
apparatus. Melton’s excellent review 
(29) of the functions of a methodol- 
ogy and criteria for evaluating ex- 
perimental methods in the study of 
human learning and retention called 
attention to this need. 

The precise standardization of 
scientific apparatus is advantageous 
or even necessary from many points 
of view: 

1. Without it there can be no ade- 
quate systematizing or theorizing. 
Unless there is careful standardiza- 
tion, it is necessary, at least in prin- 
ciple, to repeat all experiments on and 
adapt each theory for each piece of 
apparatus, in order to eliminate ap- 
paratus variables as a factor. Even 
this is not satisfactory, since a 
method would still have to be de- 
veloped to account for differences in 
results among pieces of apparatus in 
order to achieve any sort of general- 
ity. 

2. Absolute prediction for a sin- 
gle piece of apparatus is not possi- 
ble unless standardization has been 
achieved. One must be able reliably 
to calibrate one’s equipment in order 
to obtain “the same” results using 
“the same”’ experimental procedures. 
Without such calibration, apparatus 
variables cannot be held ‘‘constant,” 
and prediction in absolute units will 
not be possible. 


3. Generality of results and pre- 
dictions cannot be assured until 
units can be set up, which are based 
upon measurement operations possi- 
ble for different pieces of apparatus in 
different locations. A big step can be 
taken in this direction by using equip- 
ment which is uniform except where 
some characteristic is deliberately 
being varied. Thus, it is undesirable 
to use both brass and silver targets in 
rotary pursuit, since no experimental 
purpose is being served. 

4. Minimizing apparatus differ- 
ences will aid in the determination of 
the reasons for differences in out- 
comes of similar experiments con- 
ducted at different times in different 
laboratories. Specification of the 


“exact” nature of the apparatus may 


make it possible to narrow the search 
for variables somewhat and to con- 
centrate on procedure and subject 
variables, 


5. To design an experiment effici- 
ently, one should know what appa- 
ratus elements and conditions will 
lead to the most reliable and “‘sensi- 
tive’’ measurements, and also one 
should know what interactions be- 
tween apparatus variables must be 
controlled. If units can be “‘trans- 
lated,’’ one can choose the task vari- 
ant most likely to reveal a certain 
difference without depending on 
guesswork. 


6. As one seeks to standardize ap- 
paratus, one becomes aware of neg- 
lected variables. Once the experi- 
menter has obtained his apparatus 
and persuaded it to work “‘satisfac- 
torily,”” he tends to minimize or 
forget the fact that it and conse- 
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quently his results are unique in 
certain ways which should be taken 
into consideration in interpreting and 
in reporting. A careful survey of 
apparatus characteristics is likely to 
reveal important, potentially or ac- 
tually uncontrolled variables. 

A consideration of the literature on 
the rotary pursuit task indicates that 
we have now reached the point where 
a greater degree of apparatus stand- 
ardization may well be considered. 
This apparatus has been widely used 
in the study of learning variables; yet 
no standard construction has been 
agreed upon; and there have been 
numerous instances of disagreement 
of results, apparently due to appara- 
tus variables. In particular, the levels 
of performance reached after given 
durations of practice and the shapes 
of the acquisition curves seem to vary 
among the various pieces of appara- 
tus. 

It is not the purpose of this review 
to criticize particular investigations, 
but, rather, to evaluate and sum- 
marize the reporting of apparatus 
variables. On this basis, a list of 18 
apparatus variables was set up and 
specification of these variables was 
tabulated for each of 28 “experi- 
mental’ articles (1, 2, 4, 5, 6, 7, 8, 9, 
10, 11, 12, 13, 14, 16, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 28, 31, 34, 35, 38). 
Where numerous studies were carried 
out with the “same’’ apparatus, only 
representative work was included. 
The results of the tabulation will be 
reported, however, without mention- 
ing specific studies, since mentioning 
them would accomplish nothing as 
far as the present paper is con- 
cerned.' 

It is proposed in the following sec- 


'The tabulation can be made available 
privately to anyone who needs it for research 
purposes. 


tion to take up each of the apparatus 
variables for which the literature was 
searched, identify it, report the fre- 
quency and adequacy of its specifica- 
tion in the 28 experimental studies, 
and, where information is available, 
indicate the effects of its uncon- 
trolled variation. Certain rules were 
set up for deciding whether or not 
credit should be given to a particular 
article for specifying a variable: (a) 
“Koerth-type” or ‘‘similar to 
Koerth”’ were considered to indicate 
nothing, unless specific reference was 
made to an article which included a 
more ‘exact description. (b) Where 
direct reference was made to one of 
the “apparatus” articles (3, 15, 27, 
30, 36, 37, 39), credit was given for 
each item specified in that paper. (c) 
Where two of the apparatus articles 
were directly referred to (always 27, 
36), credit was given where only one 
specified the item or where both 
agreed in their specification. (d) No 
credit was given for reference to un- 
published material. 

In reading the following section it 
should be kept clearly in mind that 
the tabulation included only ‘‘experi- 
mental’ articles. Where ‘‘apparatus”’ 
articles are mentioned, they are 
identified as such. Melton’s chapter 
(30) dealing with the pursuit rotor 
was classified as ‘‘apparatus’’ rather 
than “experimental,’’ and so was not 
included in the tabulation, although 
it is frequently referred to in the 
discussion. 


APPARATUS VARIABLES 


Target size. Of the 28 experimental 
articles surveyed, 17 clearly specify 
the size of target used. Five refer to 
other articles in which it is specified. 
The largest size used was 1 in. in 
diameter, the smallest was 34 in., 
while the commonest was } in. (1.9 
cm.). Helmick (19) found that on- 
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target performance was increasingly 
and markedly better the larger the 
the target, going from 4 in., through 
2 in., to 1 in. 

Direction of target rotation. Twenty 
experimental articles indicate that 
the target rotated in a clockwise 
direction. In only eight cases does 
the author actually say that the 
direction was clockwise; in the other 
12 mention is made of using a phono- 
graph motor or some other specified 
motor. 

Rate of target rotation. In 24 articles 
the rate of rotation is given, and in 
two more specific reference is made 
to articles in which it is given. The 
most common rate used is 60 revolu- 
tions per minute or one revolution 
per second, but there is a range from 
32 to 80. The importance of this 
variable is indicated by the fact that 
Helmick (19) found large differences 
in performance level between groups 
practicing at 50, 60, or 80 revolutions 
per minute. 

Check on rate of rotation of target. 
It is one thing to assume that a target 
is rotating at a certain rate, and an- 
other thing to check the rate. That 
this check is necessary is indicated by 
Ruch’s finding (35) that it took his 
turntable up to fifteen minutes to 
reach a constant rate of 32 revolu- 
tions per minute. Melton (30) re- 
ports that variable-speed, governor- 
controlled motors became unreliable 
after several months, and that many 
constant-speed motors had to be re- 
jected because they differed too much 
from 60 revolutions per minute. Rate 
can easily be checked by counting 
with a stop watch or by using a neon 
light and stroboscopic disc as men- 
tioned by Melton (30). In only two 
experimental articles is mention made 
of checking rate at regular intervals. 
Doubtless other investigators did 
check, but no description is given of 


the procedure or of the outcome. 

Time for turntable to reach full 
speed and to stop. In some experi- 
ments motors were allowed to warm 
up and reach a constant speed before 
trials started and to slow down only 
after the trial was completed. In 
others, the motor started at the same 
time the trial started. In only one 
experimental article is time to reach 
full speed specified, and in no case is 
coasting time indicated. In the 
“prod” rotary pursuit task, these 
times become very important, since 
the turntable slows to a stop when- 
ever the stylus loses contact with the 
target. Melton (30) reports much 
greater differences between prod pur- 
suit units than between continuous 
pursuit units, and suggests that a 
part of the difference might well be 
due to differences in coasting time. 
Where intervals to reach full speed 
and to slow to a stop are not included 
in the trial, measurement and re- 
porting of them is probably not im- 
portant. 

Target distance from center of rota- 
tion. In only six articles is the center- 
of-target to center-of-rotation dis- 
tance given. In five other articles 
specific reference is made to other 
articles in which this distance is 
given. In several articles a measure- 
ment is given, but it is not clear 
whether it is a center-to-edge or 
center-to-center dimension. The 
range in center-to-center distances is 
from 3} in. to about 5 in., with the 
most common being 34 in. (7.8 cm.). 
It is quite probable that this distance 
is an important variable in rotary 
pursuit performance and it certainly 
should be given in descriptions of 
apparatus. 

Target material. In 19 articles the 
material out of which the target 
was made is specified. In two others, 
direct reference is made to other 
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articles which provide this informa- 
tion. Brass was the most commonly 
used, although four experimenters 
report using silver. 

Target surfacing. The target face 
tends to become dirty, pitted, and 
scratched if it is not given special 
care. In only five articles is mention 
made of cleaning or polishing the 
target and in only two of these are 
cleaning materials noted along with a 
fixed routine for doing the cleaning 
(every 10 minutes and every 125 
seconds of operation). Melton (30) 
states that an effort was made to 
minimize the oxidizing of target in 
designing the control units of Air 
Force pursuit rotors. 

Target-turntable articulation. It is 
nearly impossible to set the target in 
the turntable in a way which avoids 
any detectable irregularity at the 
target-turntable junction. If this 
break is small enough, the stylus 
movements will not be impeded by 
it. No information is given in any 
article on this point, although many 
state that the target is “flush with the 
surface of the turntable.” Spiegel 
and Crites (37) mention the problem, 
but do not suggest a measure. 

Stylus handle: size, shape, and ma- 
terial. In none of the experimental 
articles are the size, shape, and mate- 
rial of the stylus handle all reported. 
Such information is given in four ap- 
paratus articles (3, 27, 30, 36), and 
five of the experimental articles refer 
to one of these. In two experimental 
articles pictures are shown. Several 
experimenters are known to have 
used no finger guard between the han- 
dle and the hinged arm, and nothing 
is said about this guard in 26 of the 
experimental articles. The handle 
material is specified in six experi- 
mental articles, and in every case it 
is wood. In the three articles in which 
shape is given, it is round. Diameter 


varies from } in. to 1 in., while length 
varies from 10 cm. to 64 in., including 
the finger guard. It can be seen from 
this that there is little uniformity. 

Stylus arm: size, shape, and ma- 
terial. In only four of the experi- 
mental articles is complete informa- 
tion included. However, it is given in 
four of the apparatus articles (3, 27, 
30, 36); and five experimental arti- 
cles refer to one or another of these. 
Where material is specified (nine 
experimental articles), it is steel, cop- 
per, or, most often, brass. In the five 
experimental articles in which diam- 
eter is given, the most common size 
is } in., but it varies to 1/16 in. and 
12, 16, and 30 gauge. Total length 
of the arm is specified in five articles, 
and ranges from 6} to 7} in. The arm 
is bent at an angle of 80° or 90° from 
# to 12 in. from the end. Here again, 
information is seldom given, but 
where it is given, wide variability is 
indicated. 

Shape and composition of stylus 
tip. Since the subject’s score depends 
on the maintenance of electrical con- 
tact between stylus and target, the 
shape and composition of the stylus 
tip are likely to be of some impor- 
tance. Melton (30) remarks that 
silver tips on Air Force styluses 
started out flat, but were replaced 
about every 500 subjects because 
they tended to become elliptical with 
wear. In only four experimental 
studies is the stylus tip material 
specified. In all four it is silver. In 
only one experimental article is the 
initial shape of the tip mentioned, and 
in no article, experimental or appara- 
tus, is a procedure outlined for keep- 
ing the tip shape relatively constant. 

Weight of stylus tip on turntable. In 
only one experimental article is this 
weight specified (0.6 oz.). In four 
others the weight is reported as no 
more than that produced by the 
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weight of the hinged arm. In no 
case is provision made other than by 
instruction to prevent the subject 
from exerting more pressure by bend- 
ing the handle, turning the tip at an 
angle other than 90 degrees, or actual- 
ly putting a finger on the hinged sec- 
tion. The task becomes markedly 
easier in each case. Melton (30) 
states that pressure is affected by 
uncontrolled friction in the hinge and 
that Air Force styluses were designed 
with a spring in the handle to be ad- 
justed so that the pressure can be 
maintained at 22 gm. 

Total weight of stylus. Presumably 
the total weight of the stylus would 
be a factor in the rate of increase of 
fatigue during practice and the level 
reached. This weight is not given 
in any experimental article here re- 
viewed. It is given in only one of 
the apparatus articles, and is 4.2 
oz. 

Size of turntable. As has been 
noted by most experimenters with 
the pursuit rotor, the subject’s score 
is often markedly decreased if his 
stylus tip slips off the turntable. In 
part, the frequency of these slips 
will depend on the size of the turn- 
table. Disc diameter is specified in 
11 of the experimental articles, and 
in 5 others direct reference is made 
to articles in which it is specified. 
In one case a picture is shown. Diam- 
eter varies from 10} in. to 16 in., with 
27.5 cm. being the most common. 

Turntable material and surfacing. 
The friction between stylus tip and 
turntable probably determines to 
some extent the rate of corrective 
movements and hence the mean time 
off target. This friction depends 
upon the material out of which the 
turntable is constructed and how it is 
surfaced. Material is specified in 


nine experimental articles and in two 
experimental 


other articles direct 





reference is made to articles in which 
this information is given. The most 
common material (four experimental 
articles) is bakelite, but hard rubber, 
Prestwood, black vinylite, and wood 
are also used. There is some question 
about these designations, since the 
same material is described both as 
hard rubber and as bakelite in two 
different articles reporting work with 
the same piece of apparatus. Melton 
(30) mentions pitting of the surface 
near the target as a problem and 
states that it was minimized in Air 
Force equipment by the design of 
the control unit. In no other article 
surveyed is any mention made of 
surfacing. 

Scoring unit. Scoring is accom- 
plished in all the pursuit rotor studies 
mentioned here by recording the pro- 
portion of total possible time during 
which electrical contact is made be- 
tween stylus and target. No experi- 
mental article specifies the compo- 
nents of the scoring unit sufficiently 
exactly to allow reproduction; how- 
ever, four apparatus. articles (15, 
27, 30, 36) do. Standard Electric 
timers are most often used, although 
several articles report the use of 
counters. In only one experimental 
article and two apparatus reports is 
the stylus-target circuit voltage speci- 
fied. Melton (30) summarizes Air 
Force practice: 110 volts AC are put 
through stylus and target and stepped 
down by a transformer to operate 
6 volt AC timers, relays are elimi- 
nated as a possible source of variance, 
and clocks are discarded when they 
are observed to be seriously in error. 
Calibration of clocks and relays is 
not attempted. 

Height of turntable surface. This is 
not mentioned in any apparatus arti- 
cle, and is specified in only one experi- 
mental article where it is given as 
“about 70 cm.” from the floor. Since 
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time and motion studies have shown 
work surface height to be a signifi- 
cant variable affecting performance, 
it should probably be reported. 


DISCUSSION 


Of the 18 variables surveyed, the 
two most thorough experimental 
write-ups specified 9 variables direct- 
ly or by direct reference to appara- 
tus articles. ‘The two least adequate 
write-ups specified none. Melton’s 
write-up (30) of the Air Force pur- 
suit rotor specifies 11 variables. 
Several articles were excluded from 
the present tabulation and review 
because no direct mention was made 
of any of the variables, although 
direct reference was made to Melton's 
article. If these articles (17, 32, 33) 
had been included in the tabulation, 
the average number of variables 
specified would have been raised. 
In practice, many of the variables not 
mentioned at all by the various ex- 
perimenters must have been con- 
trolled but simply not mentioned be- 
cause the experimenter thought the 
procedure was so obvious. 

Several careful descriptions of the 
whole pursuit rotor (27, 30, 36) or of 
components (3, 15, 37) have appeared 
in the literature. None of these gives 
complete specifications according to 
the present criteria. Furthermore, 
one can conclude that there is no 
“standard Koerth-type pursuit ro- 
tor,’ since the various pieces of ap- 
paratus vary in important respects. 
Several studies have been conducted 
using each piece of apparatus, and 
the tendency is to refer to the original 
write-up rather than to repeat de- 
tails. Unfortunately, it is probably 
true that in no such case is the 
apparatus used the same as that 
earlier described, since modifications 
are constantly being made. Caution 
is indicated in referring to prior ap- 


paratus articles. The assertion that 
one’s apparatus is standard is not 
enough. 

Serious consideration must be giv- 
en to the problem of adequate de- 
scription of rotary pursuit apparatus. 
The present tendency is to shorten 
articles in order to conserve space. 
As long as there are apparatus vari- 
ables whose importance has not been 
assessed experimentally, their han- 
dling should be specified in the write- 
up of each experiment. Several ways 
of handling this problem seem feasi- 
ble: (a) longer apparatus section; (b) 
careful descriptions of apparatus as 
separate articles, with attention to 
modifications in particular studies; 
(c) deposit of comprehensive descrip- 
tions with the American Documenta- 
tion Institute; or (d) mimeographed 
comprehensive descriptions by each 
experimenter. This last seems the 
least adequate way out, but some 
way should be found. 


SUMMARY AND CONCLUSIONS 


The standardization of experimen- 
tal apparatus with respect to materi- 
al, design, and calibration is necessary 
if efficient experimentation, absolute 
quantification, and adequate theory 


building are to be accomplished. 
We are now at the point in rotary 
pursuit experimentation where such 
standardization could be achieved. 
The least that can be done is to speci- 
fy how important apparatus variables 
are handled in each study. A survey 
was made of the handling of 18 varia- 
bles in each of 28 rotary pursuit 
studies. At the most, the handling 
of 9 variables was adequately de- 
scribed, while two articles failed to 
specify handling of any of the varia- 
bles. Since some of the variables 
have already been shown to have a 
significant influence on performance 
and others may well have, it can be 
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concluded that the handling of all 
such variables should be specified 


in some way in each report of research 
with rotary pursuit. 
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Gaier and Lee’s recent review of 
current developments in pattern 
analysis reflects the current and 
growing emphasis upon techniques 
for the description and evaluation 
of multidimensional data. The Cron- 
bach and Gleser index of dissimilarity 
(D.,) (1) seems to us a most promising 
measure in that it retains a maximum 
amount of the information inherent 
in the original sets of scores. How- 
ever, Gaier and Lee, in referring to 
D., claim that “... this coefficient 
has a known distribution and thus 
can be used for testing the significance 
of obtained differences between 
groups or individuals” (3, p. 143). 
Neither in the Gaier and Lee article 
nor in the original Cronbach and 


Gleser report is the basis for this 


statement apparent. <A_ thorough 
examination of the latter discloses 
no reference to a known distribution 
for the general case of the D,, statis- 
tic. Indeed, the D., index was explic- 
itly developed to serve as a descrip- 
tive measure, not as an inferential 
statistic. Furthermore, Osgood and 
Suci, to whom Gaier and Lee refer 
in their discussion of D.,, conclude 
that “we have not as yet developed 
adequate methods for determining 
the significance of differences between 
distances” (4, p. 261). In the light of 
this, it seems clearly apparent to us 
that the Gaier and Lee exposition 
of D., is incorrect and quite mis- 
leading. Since in their subsequent dis- 
cussion of D,,, Cronbach and Gleser 
(2) do not call attention to this error, 
the present note should serve to 
emphasize the noninferential nature 


of D,,, and thereby prevent any mis- 
understanding which may arise from 
the Gaier and Lee statement. 

It is true, however, that there 
does exist a statistic which, under 
certain limiting conditions, is reduci- 
ble to the D., index and which pos- 
sesses a known distribution. This is 
the Mahalanobis generalized distance 
measure (D*,,), which was developed 
to deal with differences between 
group means. Cronbach and Gleser 
do discuss the relationship of D,, to 
their own D,,. In this discussion, 
Cronbach and Gleser point out that, 
when the variates of a score set are 
orthogonal and standardized, D,, can 
be shown to be equivalent to D,,.! 

When these conditions obtain, the 
D.. formula can be used for the 
computation of D,. When more than 
a few variates are involved, this 
algebraic equivalence considerably 
expedites the computational opera- 
tions. However, it must be stressed 
that this equivalence is only algebraic 
and does not imply a reciprocity of 
statistical properties for the two 
measures. Inferences from a D,, value 
calculated by the D,, formula, based 
upon the known distribution of D,,, 
are legitimate only insofar as the 
underlying data meet the assumptions 
which govern the use of D,,. What is 
important here is that D,,, as an in- 
ferential statistic, is only appropriate 


‘The independence of the variates may 
either be demonstrated by the absence of 
significant intercorrelations among these same 
variates in a comparable reference group or it 
may be insured by a transformation of the 
score to the orthogonal axes. 
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to certain kinds of group data. D.,, a highly useful descriptive index of 
on the other hand, being a descrip- dissimilarity, is not intended to serve 
tive statistic, does not allow inferences as an inferential statistic. To our 
to be made regarding the significance knowledge, the problem of the sta- 
of distances between individuals or tistical evaluation of the difference 
groups. between individual profiles, while as- 

Therefore, contrary to the claim  siduously besieged, remains uncon- 
of Gaier and Lee, D.,, while providing quered. 
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BOOK REVIEWS 


Witkin, H. A., Lewis, H. B., 
HERTZMAN, M., MACHOVER, K., 
MEISSNER, P. B., & WaAPNER, S. 
Personality through perception. 
New York: Harper, 1954. Pp. 
xxvi+571. $7.50. 


The promise held out in the title 
of this book has not been fulfilled. If 
the study of perception is, indeed, the 
royal road to the understanding of 
personality, this road is still beset 
with many dangers for the unwary or 
overenthusiastic investigator: analy- 
sis by metaphor, far-fetched analo- 
gies, circular reasoning, ad hoc and 
post hoc arguments. In a peculiarly 
negative sense, this book makes an 
important contribution: it brings 


out in bold relief the methodological 
and theoretical weaknesses of the in- 
temperate “‘clinicizing’’ of behavior. 
For readers who do not share the 
authors’ biases and preconceptions it 


will be a difficult and bewildering 
book. As one reads on, it becomes 
increasingly disturbing to move 
through a looking-glass world in 
which small variations in psychophy- 
sical judgment loom as sinister symp- 
toms of mental conflict and sopho- 
mores partial to visual cues in space 
orientation are seen as teetering on 
the brink of a breakdown. Such is 
the implausible and somber picture 
which emerges when concepts rooted 
in psychopathology are uncritically 
applied to the analysis of perception. 
If such were the facts, we should have 
to accept them whether or not they 
fit our prejudices. As we shall try to 
show, the evidence offered by the 
authors is doubtful and the logic of 
inference is often faulty and forced. 

The major purpose of the book is 
to relate individual differences in per- 
ceptual functioning to significant 


dimensions of personality. The basic 
research hypothesis is that variations 
in the mode of perception are related 
to central traits of personality. The 
perceptual responses of the individual 
reflect these central traits just as does 
his behavior in other situations, e.g., 
his performance on projective tests, 
his reactions to an interview, etc. 
This hypothesis is to be supported by 
the pattern of intercorrelations be- 
tween a battery of perceptual tests on 
the one hand and an array of per- 
sonality tests on the other. 

The perceptual function chosen for 
investigation is space orientation, as 
measured by the perception of the 
upright. The discrimination of the 
upright is governed by two major 
classes of cues: (@) visual cues, spe- 
cifically the axes of the visual frame- 
work, and (6) postural cues, i.e., sen- 
sations arising in the observer's body. 
A series of experiments by Witkin 
and Asch had shown that individuals 
differ widely in their relative reliance 
on the two classes of cues. The pres- 
ent book is entirely devoted to the 
investigation of these individual dif- 
ferences. The study was concerned 
with three major problems: (a) the 
consistency with which individuals 
relied on visual and postural cues in 
a variety of perceptual situations; ()) 
genetic developments in space orien- 
tation, and (c) the relation of individ- 
ual differences in space orientation to 
personality differences. 

A standard battery of space- 
orientation tests was administered to 
all subjects. In these tests, visual and 
postural cues were set in opposition 
to each other, and two types of judg- 
ments were required of the subjects: 
(a) judgments of the position of an 
item within the field, e.g., adjustment 


79 





80 BOOK REVIEWS 


of a rod or of the subject’s own body 
to the upright, and (6) judgments of 
the position of the total field, e.g., 
adjustment of a tilted room to the 
upright position. A series of addi- 
tional tests was designed to deter- 
mine the subject’s ability to separate 
an item from the surrounding field 
and to make effective use of postural 
cues. A great deal of ingenuity and 
skill was shown in the construction of 
the test situations; precise and reli- 
able measurements were obtained un- 
der well-controlled conditions of ad- 
ministration. 

The general results of the per- 
ceptual tests may be briefly summar- 
ized as follows: (a) Perception of the 
upright depends on both visual and 
postural cues. (b) The relative effec- 
tiveness of the two types of cues de- 
pends on the nature of the task; ad- 
herence to visual cues is more marked 
for judgments of the field as a whole 
than for judgments of an item within 
the field. (c) Perception of the posi- 
tion of the body is less influenced by 
visual factors than is the perception 
of external stimulus objects. 

As expected, individual differences 
were marked, covering the entire 
range from ccmplete dependence on 
visual cues to equally complete ad- 
herence to postural cues. Reliance on 
visual cues is regarded as ‘‘passive ac- 
ceptance of the field,”” whereas re- 
sponse to bodily cues is considered to 
be ‘active’ and “analytic.”” The 
logic of this distinction between ac- 
tive and passive perceptual reactions, 
which is later made central to the 
analysis of personality correlates, is 
vaguely metaphorical. Visual cues 
are likened to environmental pres- 
sures to which the individual may 
more or less surrender, whereas bodily 
sensations are compared to ‘“‘im- 
pulses’’ with which the individual 
may cope in a more or less active 


fashion. The distinction is difficult to 
follow. A cue is a cue and a response 
is a response, and there does not seem 
to be anything more passive about a 
response to a visual cue than about a 
response to an interoceptive or pro- 
prioceptive one. 

In any event, individuals show a 
considerable degree of  self-con- 
sistency from situation to situation 
in their relative adherence to visual 
and postural cues (or, if you will, 
their “‘passive’’ or “‘analytic’’ per- 
ceptual reactions). These modes of 
perceptual reaction also show system- 
atic genetic trends. For situations 
requiring judgments of the position 
of an object in the field, dependence 
on the visual field decreases as a 
function of age, although there is a 
reversal in this trend in late adoles- 
cence. Judgments of the field as a 
whole show no clear developmental 
trends, at least for the range of age 
groups investigated (8 to 17 years). 
It is suggested that the “primitive’’ 
passive acceptance of the field may 
be abandoned at an age earlier than 
8 years. Finally, females are more 
dependent on visual cues than males 
at all age levels, though the sex dif- 
ferences do not reach their full extent 
and significance until adulthood. 

So the stage is set for the investiga- 
tion of perception-personality rela- 
tionships. What personality traits 
distinguish the ‘‘active,"’ ‘“‘analytic” 
perceivers from the “‘passive,’’ “‘de- 
pendent” ones? To answer this ques- 
tion, a series of personality tests was 
administered to a sample of 52 male 
and 51 female college students. Many 
of the same procedures were used, in 
more or less modified form, with 
smaller samples of hospital patients 
and children of different ages. The 
personality procedures included an 
intensive interview, the Rorschach 
test, the Figure-Drawing test, the 
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Thematic Apperception test, and 
the Miniature-Toy Play Situation 
(for children). With the exception of 
the interview, all tests are of the pro- 
jective type. Why were these par- 
ticular tests chosen? One searches in 
vain for a theoretical rationale. They 
may have been selected because they 
form part of the conventional arma- 
mentarium of the clinical psycholo- 
gist, because they are more or less 
readily available, and because they 
would yield a large number of scores 
and hence a large number of correla- 
tion coefficients. We doubt, however, 
that the selection of tests was merely 
a matter of expediency. The appar- 
ently haphazard multiplication of 
projective techniques probably re- 
flects the conviction that if one but 
confronts a subject with a number of 
“unstructured” situations he will 
somehow reveal himself. Once a suf- 
ficiently large number of test ‘‘scores” 
has been obtained, the correlation co- 


efficients will somehow tell the story. 
Such shotgun correlationism is a far 
cry from good empiricism. The test 


instruments necessarily determine 
and limit the relationships which can 
be discovered and the research hy- 
potheses that can be entertained. 
These tests, which were developed 
primarily as clinical diagnostic in- 
struments, inevitably cast the inves- 
tigation of personality correlates of 
perception in a psychopathological 
mold. Consider some of the specific 
research hypotheses. Regarding 
those who rely primarily on visual 
rather than postural cues, the follow- 
ing speculations are advanced. ‘‘Such 
readiness to disregard the body cer- 
tainly is significant, also in personal- 
ity organization. It may be related, 
for example, to the phenomena of re- 
pression as a characteristic defense of 
the person (as in hysterical personali- 
ties), or to the lack of self-awareness 


found in ‘constricted’ personalities”’ 
(p. 177). As for those who made their 
judgments primarily in terms of 
postural cues, “ignoring a_ visual 
field, ignoring strong environmental 
stimulation, may be related to a 
variety of personality phenomena. 
For example, it may be associated 
with strong autistic trends, such as 
one finds in schizoid personalities, or 
with a high level of narcissistic atti- 
tude towards the self and a strong 
preoccupation with the body”’ (ibid.), 
Here they are, potestial patients one 
and all. Quite apart from the fact 
that these ‘‘hypotheses”’ are the prod- 
uct of metaphor and analogy, they 
clearly illustrate the all too frequent 
tendency to equate personality and 
psychopathology. To the pathetic 
fallacy which credits nature with 
human emotions we may now add 
the pathological fallacy which creates 
man in the image of the patient. 

It would be impossible to survey 
even in the sketchiest manner the 
multiplicity of scores which are ex- 
tracted from the various personality 
techniques (the Figure-Drawing test 
alone yields about 90 different meas- 
ures). Instead, we shall focus our dis- 
cussion on the general methods of 
analysis and interpretation. First of 
all, the authors show lack of concern 
about either the reliability or the 
validity of their instruments. Al- 
though many of the techniques seem 
to involve highly subjective and im- 
pressionistic methods of scoring, each 
test appears to have been scored by 
only one investigator (only in the 
case of the Rorschach test is there 
any mention of independent retests). 
As for validity, the authors seem 
never to question the absolute literal 
truth of every interpretation. In- 
deed, having interpreted a given score 
as an index of such and such a per- 
sonality trait, they soon proceed to 
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speak of the subject as in fact having 
this trait, as though it had been 
directly observed. The interpretation 
of the Figure-Drawing test is the 
most unrestrained in this respect al- 
though the others contribute their 
share. This reviewer knows very lit- 
tle about projective techniques but 
he is assured by experts that the 
problem of validity is indeed a serious 
one, and he was astounded to find it 
treated so lightly. The only claim for 
validity that is advanced is based on 
the results of the investigation itself. 
The high intercorrelations among the 
personality tests themselves are in- 
terpreted as evidence of their valid- 
ity (p. 451); ie., one personality 
test of unknown or doubtful validity 
is used as a validating criterion of 
another test of equally unknown and 
doubtful validity. 

Serious as the problems of relia- 
bility and validity are, the methods 
used for establishing the perception- 


personality relationships in the basic 
sample of normal adults raise even 
more formidable questions. The pro- 
tocols of those who gave extreme 
perceptual performances on two of 


the space-orientation tests were 
searched for differentiating scores. 
Only those scores which were found 
to discriminate between subjects ex- 
tremely high and extremely low in 
dependence on the visual field were 
retained for further analysis. It is 
these scores, and these scores only, 
which were used in the computation 
of the correlation coefficients. While 
such a procedure is in principle rea- 
sonable for purposes of exploration, no 
conclusions can be based on such 
forced correlations unless the meas- 
ures are validated on appropriate in- 
dependent samples. In the absence of 
such validation, the procedure would 
thoroughly beg the question which 
was to be answered by the research. 


The authors seem to be aware of this 
difficulty and offer the following 
validations of their personality meas- 
ures: (a) The personality measures 
were selected on the basis of their dif- 
ferentiating value for two of the per- 
ceptual tests but correlated signifi- 
cantly with two other tests as well. 
This is a small source of comfort 
since the various perceptual tests 
correlate highly with each other. (6) 
The measures developed on the nor- 
mal sample were later applied to 
hospital patients and to some extent 
to children of different ages, and these 
later applications also yielded sig- 
nificant relationships. Close exami- 
nation shows that this validation of 
the personality measures leaves a 
great deal to be desired. First of all, 
samples drawn from populations en- 
tirely different from the standardiz- 
ing groups hardly seem appropriate 
for purposes of validation. Second, 
we find that the measures do in 
fact discriminate much less among 
the members of these subsequent 
samples than they do among the 
members of the original group. This 
is as it should be since the original 
correlations capitalized on chance 
to the fullest possible extent. The 
authors do not seem to be willing to 
recognize the fact that they took ad- 
vantage of chance in using only 
scores that were found to be dis- 
criminating. Whenever the valida- 
tion fails, it is explained away by the 
fact that the standardizing group and 
the validating group were, after all, 
drawn from different populations! It 
appears that they want to eat their 
statistical cake and have it, too. 
Finally, it is rather disturbing that 
actual correlation coefficients are 
presented for the normal group only, 
i.e., the coefficients which must be 
assumed to be inflated by chance. No 
correlations are presented for either 
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the hospital group or the groups of 
children. For these groups, only 
totals or averages for high, medium, 
and low perceptual performers are 
given. This omission is ascribed to the 
small size of the samples (children), 
and to the fact that the validation 
groups are not “random” (hospital 
patients). The reasoning whereby 
the nonrandomness of the group pre- 
cludes the computation of correla- 
tion coefficients remains obscure. If 
the correlations are systematically 
too high or too low because of the 
nature of the distributions involved, 
it would be appropriate to present 
the values of the coefficients and to 
discuss them in relation to those ob- 
tained with the original group. 
Clearly, the major burden of proof of 
the perception-personality relation- 
ships is carried by the basic normal 
group, yet the correlations obtained 
‘or these groups are seriously vitiated 
by the preselection of differentiating 
scores. 

The final conclusions concerning 
the relationship between perceptual 
performance and personality are 
these. Those who depend primarily 
on the visual field ‘‘tend to be char- 
acterized by passivity in dealing with 
the environment; by unfamiliarity 
with and fear of their own impulses, 
together with poor control over them; 
by lack of self-esteem; and by the 
possession of a relatively primitive 
undifferentiated body image’ (p. 
469). So-called analytic perceivers 
are characterized by the opposites of 
these traits. And women show a high 
degree of dependence on the visual 
field because both their anatomy and 
the culture in which they live pre- 
scribe a passive dependent role for 
them! As we have tried to show, the 
conclusions are not warranted by the 
evilence. However, assuming the va- 
lidity of these conclusions for the sake 


of the argument, such relationships 
may be of interest to the clinician 
but do they, as the authors proclaim, 
afford the experimental psycholo- 
gist “a more fruitful and natural 
setting for his research?” They tell 
us little, if anything, about the mech- 
anisms of perception, and it is pre- 
cisely here that the theoretical and 
empirical problems of the psychology 
of perception are most urgent. We do 
not agree with the authors that this 
type of research will serve to narrow 
the gap between the experimentalist 
and the clinician; indeed, it bids fair 
to widen the separation. Let us heed 
Gresham's law in psychology as well 
as in economics lest. the doubtful 
currency of the Rorschach, TAT, and 
Figure-Drawing test drive out the 
good currency of perceptual experi- 
mentation. 
LEO PosTMAN 
University of California 


CuRIsSTIE, RIcHARD, & JAHODA, 
Marie. (Eds.) Studies in the scope 
and method of ‘‘The Authoritarian 
Personality."’ Glencoe: The Free 
Press, 1954. Pp. 279. $4.50. 


The Authoritarian Personality, pub- 
lished in 1950, presented a half million 
word report on an ambitious program 
of social and personality research. 
Begun as a study of anti-Semitism 
and prejudice, the work spread to 
related personality origins, life atti- 
tudes, and political beliefs. The scope 
and methodology of the project un- 
folded gradually, sometimes clumsily, 
and was often forced on the workers 
by new insights and the importance 
of the material uncovered. The con- 
tributions to method are not neat, 
perfectionist models of old methods: 
they are rather new applications, 
modifications, and combinations of 
techniques. 

Now, four years after the original 
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publication, the present book on 
Scope and Method' attempts to eval- 
uate the original work, both in terms 
of significance and techniques. The 
seven authors review, weigh, and 
criticize every detail, point out de- 
fects and omissions, praise the over- 
all project. They leave the rigor- 
trained reader, and themselves, a 
little puzzled as to how a study with 
so many defects can contribute so 
much. Their answer seems to be 
“scope”’ and the pioneering signifi- 
cance of content and methodology. 
This thesis, not specifically stated in 
the book but running through all 
the contributions, might be formu- 
lated as follows: 

From the first, the study of the 
authoritarian personality was a cru- 
sade. The sponsors, research workers, 
and authors had strong convictions— 
prejudices if you will. In planning, in 
executing, in presenting their work, 
they were determined to say “‘some- 


thing of some importance about an 


important subject.”” They put the 
significance of the problem above the 
limitations of tried-and-true method- 
ology. They were undoubtedly well 
equipped with unconscious motiva- 
tions, with peculiarly unobjective 
group-bound viewpoints, and with 
the overdetermined honesty of men- 
of-good-will and of scientific training. 
They took the plunge, and produced 
a work of the utmost importance. 
The dilemma is not a new one, nor 
is it faced today by social scientists 
alone: Shall we take a bold stand 
based on our convictions, or shall we 
be objective? And can we do both? 
No better model for conduct in the 


' The book reviewed here will be referred to 
throughout as Scope and Method. The com- 
plete title is given at the beginning of the re- 
view. The abbreviated title serves to dis- 
tinguish the present volume from the original 
work entitled The Authoritarian Personality 


face of this dilemma could be offered 
than that set by Nathan Glazer? in 
his review of Scope and Method. He 
writes for Commentary, published by 
the American Jewish Committee: 
there is no doubt where he stands on 
anti-Semitism; he is part of the cru- 
sade. Yet he keeps the faith; puts 
forth persistent effort to evaluate 
the original research and the later 
volume objectively, carefully, honest- 
ly. The original research workers 
were simply unable to control their 
own bias, to keep it out of their termi- 
nology, methods, and results. The 
authors of Scope and Method are 
troubled, laudatory, critical, clear on 
specifics, confused in general. They 
are aware of the dilemma but often 
turn to details of method, defensive 
praise of the righteous cause, or busy 
enthusiasm for new developments. 
The whole history of this research, 
which is a bold attempt to under- 
stand prejudice or bias, is a struggle 
with our own biases. The struggle 
promises to be successful and should 
result eventually in a high degree of 
honest and scientific understanding. 
The often harsh criticisms of Scope 
and Method are the sincerest tribute 
to the vision, faith, and integrity of 
the original sponsors and research 
workers. They undertook to study 
bias with bias, but they set forth their 
plan, their methods, their evidence 
in sufficient detail so that others can 
evaluate, correct, and reformulate. 
The authors of the present book 
consider the scope, method, and “‘im- 
pact” of the original research. In 
scope, the studies recognized the 
importance of teasing out explicit 
relations between opinions and social 
discrimination on the one hand, and 
complex personality and cultural 
New light on “The 
Commentary, 


? GLAZER, NATHAN. 
Authoritarian Personality.” 
March 1954, 289-297. 
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factors on the other. In method, they 
combined theoretical deductions, in- 
tensive studies of individuals, and 
large-scale statistical methods. In 
impact, they have stimulated a tre- 
mendous output of additional re- 
search, including surprising insights 
into related fields of belief and social 
roles. 

Scope and Method indicates that 
the original research fails to demon- 
strate the main thesis. The person- 
ality of the authoritarian emerged 
as rigid, uncritically submissive, 
prejudiced against outgroups and the 
unconventional, preoccupied with 
problems of status and power, dis- 
posed to believe in fantastic influ- 
ences going on in the world, over- 
concerned with sexual transgressions, 
etc. These traits presumably stem 
from psychoanalytically interpreted 
experiences of early childhood and 
parental handling. The critique of 
method leads to the conclusion that 
“almost nothing stands up of the 
original study.”” The main hypothe- 
sis, never clearly stated, is not con- 
firmed. Nevertheless an impressive 
body of objectively suggested (!) or 
subjectively logical relationships is 
set forth. For the most part they 
do not explain so much as demand 
explanation, but they are an impor- 
tant beginning. It should be added 
that they could easily be misused to 
foster prejudice and group hatred. 

The most fascinating contribution 
to method is the use of intensive 
studies of individuals, the attempt 
to employ such studies with some 
scientific sophistication, and _ the 
awareness that they may potentially 
play a role in “rigorous” evidence 
when suitably fitted in with other 
approaches. The evaluation of this 
phase of methodology and the cri- 
tique of opinion and attitude meas- 
urement are outstanding in Scope 


and Method. Yet, one must agree 
with Hyman and Sheatsley’s con- 
clusion that methodology, as ordi- 
narily defined, is the earlier study's 
“weakest point.” This reviewer 
would sum up the over-all defect as 
related to the controversial nature of 
the problem and the original bias. 
Hyman and Sheatsley say: ‘‘the mis- 
takes and limitations—no one of 
them perhaps crucial—uniformly 
operate in favor of the authors’ as- 
sumptions.” 

The “impact” value of the study is 
presented in Scope and Method as the 
third major contribution or consider- 
ation. A passing protest must be 
offered against the current naive as- 
sumption that anything is good if it 
stimulates further research: the wide- 
spread interest in prejudice and 
opinion systems might as well be 
attributed to Hitler, with the corol- 
lary that Hitlerism was therefore 
good. Research that has a great 
impact or heuristic value should have 
a corresponding degree of nobility 
and integrity. In this case, these 
latter qualifications are more than 
fulfilled. 

While the current volume on Scope 
and Method builds on the earlier 
work, it may prove to be the more 
useful of the two as a reference and 
source of stimulation for future stud- 
ies. 

Roy M. HAMLIN 

Western Psychiatric Institute, 

University of Pittsburgh 


GRAY, J. STANLEY. (Ed.) Psychology 


applied to human affairs. (2nd 
Ed.) New York: McGraw-Hill, 
1954. Pp. vii+581. $6.00. 


Like its predecessor published in 
1946, this revised edition is contrib- 
uted to by specialists in various 
fields. While chapter titles and se- 
quence have changed, the only new 
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topical area covered is job evaluation. 
Treatment of this is coupled with 
merit rating in Chapter 15, Psychol- 
ogy in Industry—Wage Determina- 
tion. The literature coverage seems 
to extend through 1950, judging 
from end-of-chapter references. This 
edition, too, is designed as an under- 
graduate textbook for the second 
semester course where the curriculum 
provides a first semester general psy- 
chology offering followed by a course 
in “applied.” 

The book has an admirable con- 
tinuity and lucidity of style not- 
withstanding its multiple author- 
ship. Further, pertinent cross-refer- 
encing between chapters contributes 
to the cohesiveness of the subject 
matter presented. In connection 
with the latter, the reviewer noted 
only one instance of a topic being 
treated repetitively—tests as screen- 
ing tools are discussed in Chapter 
13, Psychology in Industry—Em- 
ployment Methods, and again, de 


novo, in Chapter 16, Psychology in 
Marketing and Advertising, applied 
to sales selection. 

A molar reaction may be of inter- 


est. Haven't we grown beyond the 
single course in applied psychology? 
What does the undergraduate gain 
from the cursory, perhaps even per- 
functory, ramble which furnishes him 
nibbles of personal efficiency, psy- 
chology of learning, clinical applica- 
tions, work and efficiency, selling 
and advertising, etc.? If such a 
course still has a place in our curricu- 
lum Gray's book will structure the 
trip and furnish a brief view of each 
station en route. 

The specific criticisms which fol- 
low are acknowledgedly selective. In 
the chapter on vocational guidance 
the term “aptitude’’ is used two 
ways—as an embrasive term for all 
personal qualities needed for voca- 


tional success and in its more limited 
meaning as potential to learn. This 
may cause confusion for the naive 
reader. In the same chapter the au- 
thor states two future occupational 
trends as (a) more and more govern- 
ment function with the implication 
of larger government payrolls, and 
(b) “‘.. . the slowly developing trend 
toward the more efficient distribu- 
tion of goods (through various forms 
of cooperative agencies) and the elim- 
ination of the parasitical and func- 
tionless middleman...’ (italics are 
the reviewer's). The former seems 
justified if our perspective is limited 
to the period 1932-52, but there is a 
hope shared by many that this in- 
crease in government size and scope 
is now being curtailed. The second 
trend seems out of line with market- 
ing data and, at best, a poor value 
judgment with respect to American 
marketing methods. 

The frame of reference for person- 
nel selection is the job to be done. 
Particularly in the case of nonsuper- 
visory openings, the applicant must 
fit certain prescribed manpower speci- 
fications. Because job evaluation is 
not presented until a later chapter, 
the topic of employment methods is 
treated without the necessary tie to 
the job. In the same chapter the 
concept of the selection ratio in its 
relation to validity is omitted from 
the treatment; instead disscussion is 
in terms of Hull’s E, a statistic of 
questionable utility in the field of 
employee screening and selection. 

In the chapter on training and 
morale, the former topic is treated 
more as skill acquisition than as 
education of the whole individual. 
Further, no point is made of day-to- 
day supervision as the major setting 
for training. Even the management 
literature uses the term development! 
In discussing morale, no mention is 
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made of the sociometric concept of 
morale as group cohesiveness or of 
morale as goal-sharing activity. It 
is unfortunate that job evaluation 
and merit rating are treated under 
wage determination. It is true that 
this is one administrative use of these 
personnel functions. However, job 
analysis and resulting job descrip- 
tions which precede the former serve 
also as a frame of reference for selec- 
tion, training, and assessment of 
performance. The assessment phase 
of the latter has, as a primary func- 
tion in day-to-day supervision, the 
development of the individual em- 
ployee 

The high-light chapter is the one 
prepared by Stagner, Put tic Opinion 
and Propaganda. His 35 pages are 
chock-full of current information in 
this area and he draws a cohesive 
picture for the reader, documenting 
each subtopic with pertinent research 
findings. 

In summary, the book is recom- 
mended for the instructor in under- 
graduate applied psychology classes 
who is prepared to supplement it 
with a structure of underlying theo- 
ries and principles. Indeed, the latter 
is what is recommended to be done 
by Gray, himself. 

W. J. E. Crissy 

Queens College 


CARROLL, JOHN 
language. Cambridge: Harvard 
Univer. Press, 1953. Pp. xi+289. 
$4.75. 

Various academic disciplines, in- 
cluding psychology, are represented 
among those who study language. 
This book attempts to show how 
linguistic science studies language 
and how it is related to the other 
disciplines in this study. It is an 
expanded version of a report by the 
author to the Carnegie Corporation 


B. The study of 


of New York entitled ‘“‘A Survey of 
Linguistics and Related Disciplines.” 

Psychologists will welcome the 
concise description in Chapter 2 of 
The Science of Linguistics. Although 
linguistic findings are not presented 
in any detail, Carroll tells us about 
the several aspects of linguistic 
science: the methods used, the nature 
of the tasks undertaken, and the dif- 
ferent kinds of theoretical models 
sought. 

Four chapters, extremely uneven in 
both style and content, show the rela- 
tions between linguistics and psychol- 
ogy, social sciences, philosophy, and 
education, respectively. By restrict- 
ing linguists to the study of language 
systems as systems, Carroll points 
out that a complete analysis of lan- 
guage behavior must involve extra- 
linguistic studies. The psychology of 
language, in particular, has to do 
with the problems of meaning, lan- 
guage learning, and the intentive and 
perceptive aspects. In an apparent 
attempt to preserve academic bar- 
riers, Carroll suggests that, in addi- 
tion to the important areas to be 
studied by linguistics on the one hand 
and by psychology on the other, an 
intermediate area, which he calls 
“‘psycholinguistics,”’ be recognized. 
This reviewer feels that these area 
specifications seem too formal and 
fail to recognize the necessity that 
students of language be responsible 
for the methods and concepts of 
several disciplines. The requirement 
that a new field of endeavor be set up 
and labeled deprecates the compe- 
tence of serious scholars and scientists 
and gives assent to the mediocrity 
that is sometimes misnamed speciali- 
zation. 

The relation between linguistics 
and sociology and anthropology is 
covered quite briefly and super- 
ficially, the most interesting part 
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being a consideration of the problems 
attending the creation of an inter- 
national auxiliary language. The 
relation between the linguistic and 
philosophical approaches to language 
is also treated briefly; while the 
philosopher considers all possible 
syntactic relations in any sign- 
system, the linguist restricts himself 
to natural languages. 

Ranking with the chapter on lin- 
guistics as a science, so far as informa- 
tion is concerned, is the one on lin- 
guistics and education. Here Carroll 
considers several applications includ- 
ing the teaching of the language arts, 
speech education, general semantics 
in education, and the teaching of a 
second language. The author seems 
most comfortable with the material 
here, presenting many interesting 
hypotheses and conclusions which 
are much better documented than 
those in other sections of the book. 
It also appears that the excellence of 
this chapter is partly due to the ab- 
sence of concern with the role of dif- 
ferent disciplines. The material is 
presented in a well-organized fashion, 
without any strained concern for the 
academic backgrounds of the quoted 
authors. 

There is also an elementary review 
of information theory and of the in- 
fluence of communications engineer- 
ing on the study of speech. One re- 
sult of a somewhat narrow view of 
this area is that the chief contribution 
of communications engineering ap- 
pears to be the provision of analyzing 
and synthesizing machines which 
have opened new areas in the labora- 
tory study of language. 

The book satisfactorily gives the 
reader a sense of the complexity of 
language study and of the several 
fields contributing thereto. The 
author’s criteria for what is to be 
termed scientific study appear to be 


inconsistently applied. He sometimes 
objects, for example, to the use of 
“experimental phonetics’ because 
some of the studies under this head- 
ing have not been thoroughly experi- 
mental. On the other hand, he im- 
plies that linguistics is further ad- 
vanced as a science than psychology 
because it was already scientific by 
the time Wundt opened his labora- 
tory. This curious conclusion fails to 
distinguish between the kind of ex- 
perimental science that psychology 
started to be around 1879 and the 
observational kind of science, like 
astronomy, that linguistics continues 
to be. 

It is certainly becoming impossible 
for any one student of language to 
know all of the fields that contribute 
to its study, and thus Carroll's book 
will be useful for one who wishes at 
least to become aware of the many 
approaches that have been made. It 
would be better perhaps to refer to 
this as a guidlebook rather than a ba- 
sic reference, because with its synop- 
tic organization and extensive bibli- 
ography, the student of language in 
any discipline can find suggested to 
him those areas that seem most inter- 
esting or appropriate to his own study. 

IRA J. HiRsH 

Central Institute for the Deaf 


ARNOLD, Macpa B., & Gasson, J. A. 
(Eds.) The human person: an ap- 
proach to an integral theory of per- 
sonality. New York: Ronald, 1954. 
Pp. x+593. $6.00. 


This book was written by and for 
psychologists of the Roman Catholic 
faith. It is based on a series of papers 
presented in 1951 at a workshop in 
personality at Barat College. These 
papers (by a number of authors, in- 
cluding the editors) were expanded 
by Arnold and Gasson by means of 
integrating comments that appeared 
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in many cases as addenda to particu- 
lar chapters and in other cases as 
separate chapters. In view of this 
editorial integration, and in view of 
the authors’ shared belief in Catholic 
doctrines, it seems justifiable to treat 
the book as a whole rather than to 
evaluate the contributions of the 
several authors. 

The reader is at once impressed by 
the fact that the authors are inclined 
to subordinate psychological princi- 
ples to the articles of their religious 
faith—that they attempt to remodel 
psychology in the image of an ideol- 
ogy. Although it seems to the re- 
viewer, as it probably seems to the 
majority of psychologists, that man’s 
behavior should be viewed as a 
system of naturalistic events occur- 
ring within a naturalistic context, the 
authors declare that only in experi- 
mental physics and chemistry should 
the scientist “investigate what is, 
without reference to its ultimate 


And, 


nature, origin, or destiny.” 
foreseeably, man’s nature, origin, and 
destiny are conceived in terms of a 
strictly orthodox Catholic frame of 


reference. Man differs from animals 
according to the authors, not quanti- 
tatively but qualitatively; man is 
said to be ‘‘a creature who can know 
himself and his Creator, toward 
whom he tends as the goal of his 
specifically human capacities.” 

The authors reject the principle of 
determinism as applied to human 
behavior and substitute for it the 
principle of self-determination. If 
this latter principle were meant to 
imply nothing other than that be- 
havior determinants function within, 
rather than outside, the individual— 
that, for example, a person’s behavior 
is determined by his perception (in- 
cluding its conscious and unconscious 
motivational components)—it would 
not be difficult to agree with the 


authors, since, by this interpretation, 
they would be subscribing to the 
principle of determinism. However, 
they state that “we... settle the 
issue between conflicting motives by 
the active interposition of the ego.”’ 
This ego is conceived to function in- 
dependently of, and to transcend the 
influence of, motivation. Thus, the 
authors deny determinism and sub- 
scribe to “free choice”’ or “free will’’ 
—a conception that cannot be inte- 
grated with an organismic view of 
behavior and that denies the poten- 
tial predictability of human actions. 
This seems to be an unfortunate sac- 
rifice when, in fact, it is possible to 
maintain determinism, without con- 
tradicting self-determination, by pos- 
tulating that religious and ethical 
values are functional components of 
the motivational pattern. 

The aim of personality growth, and 
hence of psychotherapy, according to 
the authors, is that of closing the gap 
between the “‘self-ideal as it actually 
is’ and the “‘self-ideal as it ought to 
be.”” They point out that therapy 
has characteristically involved cor- 
recting the self-concept (closing the 
gap between “what I am and what I 
think I am’’) and adjusting the self- 
ideal to the self-concept (reducing the 
level of aspiration to the person's 
corrected perception of his potential 
achievement). However, “if there is 
a gap between the self-ideal as it ac- 
tually is and the self-ideal as it ought 
to be, then there will be a disturb- 
ance” in the person’s adjustment. 
This, of course, is a valuable point to 
emphasize, provided that the “‘self- 
ideal as it ought to be”’ is defined in 
terms of the prevailing social values, 
for this would merely imply that the 
person will experience frustration in 
his social relations if his values are 
too unconventional. But the authors 
define the ‘‘self-ideal as it ought to 
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be” in terms of Christ's perfection— 
an ideal that is admitted to be prac- 
tically unattainable. Such a thera- 
peutic goal, besides being unrealis- 
tically high, would be applicable only 
within a Christian culture, and in- 
deed only within a much more dedi- 
cated Christian culture than is likely 
to occur in the foreseeable future. 

The authors demonstrate a high 
degree of sophistication in dealing 
with psychological concepts deriving 
from varied schools of thought, if 
only for the purpose of refuting many 
of the concepts that are widely ac- 
cepted among present day psycholo- 
gists. To the Catholic reader, per- 
haps, this book will seem to offer 
significant contributions. The non- 
Catholic reader, however, is likely to 
find it disappointing, since it pro- 
vides, instead of an integrated theory 
of personality, little more than a dis- 
cussion of psychological principles in 
terms of their agreement or disagree- 
ment with Catholic doctrines. 

Bert R. SAPPENFIELD 
Montana Staie University 


Harms, Ernest. Essentials of abnor- 
mal child psychology. New York: 
Julian Press, 1953. Pp. xiii+265. 
$5.00. 


This is one of the most unorthodox 
books in the field of psychology the 
reviewer has seen in a long time. 
From the title one would expect a 
systematic treatment of the psycho- 
pathology of childhood; what one 
actually finds is a series of independ- 
ent articles. Much of the material 
presented is intriguing because of its 
novelty. 

Bemoaning the fact that there is no 
clear concept of normality or abnor- 
mality in psychology today, Harms 
promises early in his volume to rec- 
tify this situation: ‘But I do feel that 
I have attempted an advance in a 
direction which seems more promis- 


ing of a proper solution than any pre- 
viously tried.’’ His first major con- 
tribution is the concept of paternus 
and materna as a substitute for 
Freud's infant sexuality and Oedipus. 
Calling this “a new scientific for- 
mula,” he defines paternus and ma- 
terna: ‘‘They represent the aboriginal 
relationship of a human child to his 
parents."’ Much is made of this con- 
cept in a very involved chapter; 


freely translated, Harms says that 
relationships are 


parent-child im- 
portant. 

Without any transition, the author 
plunges into an involved discussion 
of childhood schizophrenia and hys- 
teria, the burden of which is that 
these conditions are frequently mis- 
diagnosed. Both, he says, are caused 
by “basic ego malfunctioning,” but 
in the former, therapy must be aimed 
directly at ‘reconstruction of the ego 
functioning,”’ while in childhood hys- 
teria the same objective must be 
sought indirectly. How this is done 
is not stated but we are urged, when 
in doubt, to err in the direction of 
treating hysteria, since no harm can 
result from this. 

Another ‘“‘new concept” presented 
by Harms is that of ego inflation and 
ego deflation. The latter is hardly 
distinguishable from feelings of in- 
adequacy, and the former seems to 
resemble compensations for such 
feelings, but it requires 24 pages to 
bring this out. It takes another 52 
pages to point out that clinicians 
tend to underestimate the existence 
of manic-depressive psychoses in 
childhood and adolescence. 

“The most important contribution 
this volume makes to the field,” says 
Harms, is the chapter on the Mignon 
Neurosis. “The Mignon Neurosis 
represents a one-sided developmental 
pattern in which the child wants to 
grow up faster than he can. . . . Psy- 
chopathologically, the Mignon Neu- 
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rosis may be designated as a paranoid 
pattern related to a schizoid back- 
ground which may sometimes ac- 
tually develop into a case of juvenile 
schizophrenia. Some present-day 
psychiatrists may wish to label it as 
a behavior disorder, others may ac- 
cept our classification of it as a de- 
velopmental neurosis.” Apparently 
we have a choice. 

The work is divided into two parts, 
and Part II deals with Diagnostic 
and Therapeutic Procedures. After 
detailed discussion of child art as 
diagnostic means—treated as though 
no one else had ever used this medium 
properly—we find presentations of 
original therapeutic approaches. 
Transitional Therapy is a mystical 
approach to “the symbolism of sub- 
conscious collectivism and mytho- 
logical consciousness,"’ by use of art 
production. Autogenic Therapy 
seems to be a combination of self- 
analysis and relaxation. Substitution 
Therapy is designed for orphans only. 
Nor is this all. We find chapters on 
the incorrect diagnosis of feeble- 
mindedness, on the education of the 
mentally retarded, and on brief child 
guidance treatment. 

The book is original, provocative, 
and provoking. It left this reader 
dazed. 

Morris KRUGMAN 

Board of Education, 

New York, New York 


SEGAL, J. Le mécanisme de la vision 


des couleurs. Paris: G. Doin & 
Company, 1953. Pp. 351. 3,000 fr. 


Visual specialists for the most part 
still subscribe to the classical tri- 
chromatic conception with regard to 
the color mechanism, even though 
recent evidence concerning the na- 
ture of the retina and the involved 
neural functions seemingly has failed 
to narrow the gap that long has 
existed between fact and theory in 


vision. Attempting to evaluate more 
closely the nature of the actual color 
mechanism that is indicated by the 
growing body of relevant informa- 
tion, several investigators, working 
independently, have recently reached 
a negative conclusion as to the pros- 
pect of confirming the prevailing 
color theory and have proceeded to 
set forth views of their own. After 
many years of intensive study and 
experimentation, Segal has reached 
the same conclusion, and in the pres- 
ent work submits his own construc- 
tive formulation along with the argu- 
ments that are associated with its 
development. 

Although the new theory is cast in 
the familiar trichromatic mold, Segal 
breaks sharply from conventional 
thinking in regard to the details of 
visual physiology. Divesting himself 
of prevailing conventions, he envis- 
ages the mechanism underlying color 
in terms of essentially new relation- 
ships. The logical reasoning on which 
the new prospect of the mechanics 
of color is based is worked out in con- 
siderable detail and in relation to 
much of the relevant knowledge that 
steadily has been accumulating. 

Three fundamental colors are 
needed according to Segal. More 
than three would involve too much 
overlapping to satisfy the facts of 
saturation. He chooses as primaries 
red, blue-green, and violet, and desig- 
nates yellow and blue equilibrium or 
intermediate colors. The primaries 
at the two extremes are both extra- 
spectral, i.e., purplish, since extreme 
spectral red tends toward yellow and 
violet toward blue as intensity is in- 
creased. 

Examining the absorptive proper- 
ties of some seven hundred different 
colorants, Segal finds a carotenoid- 
protein which interacts with oxygen 
that seems to fill the absorptive re- 
quirements of the color mechanism. 
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This is rhodopsin, a derivative of 
vitamin A, which appears in different 
states to meet the needs of selective 
absorption for differences in wave 
length. In the liquid state, rhodopsin 
has the absorption characteristics 
needed for the intermediate spectral 
wave lengths (blue-green), and in the 
first step of its photolysis under the 
action of light known as transient 
orange, the substance which suits the 
need for short wave absorption is 
found. The identity of the yellow 
macular pigment and transient or- 
ange is confirmed. Transient orange 
is said to be much more stable than 
Lythgoe, among others, has asserted. 
No substance in solution is highly 
absorptive enough for long wave 
length reception, but a microcrystal- 
line form of the substance which ab- 
sorbs long wave lengths in sufficient 
concentration is found in the pigment 
epithelium. Segal’s is not a dual 


system, achromatic vision being ac- 
counted for through absorption by 


rhodopsin at the lower range of in- 
tensities. Of the 120 illustrations and 
graphs contained in the book, many 
are designed to show how the absorp- 
tive properties of the different forms 
of the substance are adequate to both 
chromatic and achromatic needs. 
Early in the work, in a 38-page 
second chapter, Segal makes a devas- 
tating attack upon the dual theory of 
vision. The following are the chief 
arguments: Histological observation 
has shown that the retinal elements 
in the foveal center are rodlike rather 
than conelike. Rhodopsin is found in 
the external segment of the cones as 
well as in the rods, and the photo- 
sensitive substances in the rods and 
cones are found to have the same 
concentration, the same kinetics, and 
both derive from vitamin A. Adapta- 
tion, one of the bulwarks of the dual 
theory, is ascribed to differences in 
the rate of recovery of sensitivity fol- 


lowing exposure to light above and 
below the critical light intensity of 
6,000 trolands. Following exposure 
to intense light, the retinal elements 
having individual nerve connections 
are involved in activity and exhibit 
rapid recovery; after mild exposure, 
neural elements in which a number 
converge upon a given nerve fiber are 
active, manifesting a slow rate of re- 
covery. The Purkinje phenomenon 
is explained by means of a difference 
in retinal function (a) when rhodop- 
sin has been accumulating under very 
low levels of light exposure, and (6) 
when rhodopsin has been converted 
to transient orange under the in- 
fluence of intense light. 

A chapter of 34 pages is devoted to 
a consideration of the way in which 
the retinal elements are projected 
upon subcortical and cortical centers. 
The argument here necessarily is 
more schematic, although evidence is 
brought forward to indicate that the 
three absorptive pigments are indi- 
vidually represented at least as far as 
the lateral geniculate bodies. Experi- 
mental evidence varied in nature is 
interpreted as signif ying that phenom- 
ena of contrast and acuity do not 
depend on the red and violet com- 
ponents of the system, but wholly on 
the blue-green. Even if the argument 
regarding these aspects of the color 
mechanism is less terse and more in- 
complete, Segal at least has not 
failed to recognize that over-all con- 
siderations of the color mechanism 
include postretinal physiology. 

The longest chapter in the book is 
concerned with the various anomalies 
of color vision. It embraces 72 pages 
—more than 20 per cent of the entire 
book. In terms of different degrees of 
deficiency or superabundance in one 
or more of the receptive components, 
explanations are provided for the 
several well-known varieties and the 
different degrees of anomalous color 
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vision. These are shown to be con- 
sistent with an abundance of experi- 
mental and clinical data, including 
the Rayleigh equations. New infor- 
mation on congenital and acquired 
anomalies, obtained in collaboration 
with Or. Dubois-Poulsen of Paris, 
is submitted and a suggestion made 
that a joint work is in preparation. 

The bibliography of 130 items in- 
cludes no reference to secondary 
sources or to general works. Only five 
or six of the references go back as far 
as 1920. The newer literature involv- 
ing the use of electrophysiological 
techniques receives particular atten- 
tion. While a few important names 
are not to be found, the bibliography 
includes the more important ones. 

In prospecting with regard to the 
nature of the color mechanism, Segal 
is motivated in an entirely objective 
way. Realizing that the long existing 
gap between fact and theory in vision 
has not been becoming more narrow, 
he has made a critical evaluation of 
an extensive literature in an effort to 
take an initial step toward a more 
promising envisagement of the prob- 
lem. In designating his formulation 
a working hypothesis, he uses a de- 
vice to safeguard himself with regard 
to modifications that may be found 
necessary in the future. In a thor- 
oughly scholarly manner, he has pre- 
sented what amounts to a simplifica- 
tion of the physiological mechanics of 
color vision. Whether the new pros- 
pect is confirmed in its entirety, or in 
part only, Segal has made a contribu- 
tion to the psychophysiology of color 
that is bound to serve as the basis for 
further critical studies designed to 
come more closely to grips with the 
true nature of the color mechanism. 

MICHAEL J. ZIGLER 

Wellesley College 


PERRY, RALPH BARTON. Realms of 
value. Cambridge: Harvard Uni- 


ver. Press, 1954. 


$7.50. 


In his latest major publication 
Professor Perry offers an extension of 
his earlier General Theory of Value 
(1926), which has remained the 
standard work in its field. The scope 
of the present work is suggested by 
the subtitle, “A critique of human 
civilization."’ Its significance, from 
the psychologist’s point of view, lies 
more in the pointing up of a number 
of critical problems that need to be 
met, ultimately, than in any specific 
contributions to content or method. 

In his systematic orientation to 
psychology the author has been much 
influenced by several of his Harvard 
colleagues, particularly Gordon All- 
port and Henry Murray. Frequent 
quotations from and allusions to 
these and similarly oricnted authori- 
ties (e.g., E. C. Tolman) indicate this 
bias. The resulting “holistic” and 
“dynamic” flavor of the work will 
probably insure a considerably more 
sympathetic reception of it by so- 
cially and practically oriented psy- 
chologists than by those holding to 
other biases. 

The importance of psychology in 
Perry's thinking is attested by the 
prominence of recourse to psycho- 
logical principles and problems in the 
first few chapters. In these he en- 
deavors to develop a fundamental 
definition of ‘‘value”’ for exposition in 
a wide variety of fields of knowledge 
throughout the remainder of the 
book. His main theme is that value 
is based upon interest, which in turn 
is based upon expectations of the 
outcomes of actions. Perry's failure 
to come up with any satisfactory 
answers to the problems of motiva- 
tion, with which he is squarely faced 
at this point, is hardly an occasion 
for surprise. In this respect he is in a 
large and growing company, by no 
means restricted to philosophers. 





94 BOOK REVIEWS 


Nevertheless, he is to be credited 
with at least having begun to ask 
some of the important questions, 
rather than to have sidestepped them, 
as has been customary in similar 
philosophical treatises. 

A major failing in Perry's treat- 
ment of motives, as | see it, concerns 
his emphasis (or perhaps overempha- 
sis) upon the role of cognition as a 
determiner of interest. Although he 
does upon occasion mention the role 
of habitual and unconscious factors, 
these are not elaborated, and they are 
considered to operate in an entirely 
subsidiary manner. This is a reflec- 
tion of his general bias, mentioned 
above. 

Perry is to be commended for set- 
ting up and holding to a carefully 
prepared set of definitions. More- 
over, the book is written in a 
straightforward and easy-to-under- 
stand style. A generous sprinkling of 
neatly turned phrases helps to main- 
tain reader interest. 


Although I do not see how this 
book can have any very direct effect 
upon the development of psycho- 
logical science, it is nonetheless an 


important one for contemporary 
psychologists to consider. It repre- 
sents the mature judgments of a 
scholar who has spent a lifetime of 
reflection on issues of great import 
to all of us, scientist and nonscientist. 
It presents a great many problems 
which are at once disturbing and 
challenging, and which are today of 
particular concern to the scientist- 
citizen. Regardless of the degree of 
success of its suggested solutions in 
any given field, it is to be regarded, 
in my opinion, as a first-rate achieve- 
ment of a scholar who may still be 
called a philosopher in the original 
and broadest meaning of the term. 

Metvin H. Marx 

University of Missouri 


Brown, Jupson S., ef ail. Current 
theory and research in motivation: a 
symposium. Linco:n, Nebraska: 
Univer. of Nebraska Press, 1953. 
Pp. v+193. $2.00. 

This small volume is a collection 
of six prepared papers and the com- 
ments to them generated among the 
participants of a two-part symposium 
on motivation held early in 1953 at 
the University of Nebraska.  Al- 
though the papers tend to be uneven 
in quality and variable in the degree 
to which theoretical formulations and 
experimental data are emphasized, 
they are by and large stimulating, 
informative, and provocative. The 
comments on these papers add con- 
siderably to their value and give the 
reader a feeling of being witness to a 
scholarly free-for-all, in which each 
participant seems to wield his cudgels 
with an open and objective mind, 
firmly convinced of the correctness of 
his own position. In this latter re- 
spect, the comparative psychologists 
-——J. S. Brown and H. F. Harlow— 
appear to be somewhat argumenta- 
tive, whereas the “human”’ psycholo- 
gists tend to be more congenial and 
accepting in their comments. 

Briefly, the six participants and 
their main positions are as follows: 
]. S. Brown urges that drives do not 
have response directing properties, 
but are mainly energizing or activat- 
ing in function; H. F. Harlow insists 
that homeostasis and internal stimuli 
have been overemphasized as motiva- 
tional sources of learning while im- 
portant and basic external drives (like 
curiosity, manipulation, and explora- 
tion) have been neglected; L. J. 
Postman concludes that motivational 
elements in perceptual behavior can 
be understood in terms of the general 
principles of associative learning such 
as frequency, recency, and effect; V. 
Nowlis describes research (mainly 





BOOK REVIEWS 95 


that in which he participated while 
at the Iowa Child Welfare Research 
Station and later at the University 
of Rochester) on motivational sys- 
tems in children and adults; T. M. 
Newcomb feels that motivation stud- 
ied in a social psychology setting 
should be anchored in the external 
constellation of communicative acts; 
and finally O. H. Mowrer reaffirms 
his position that faulty superego de- 
velopment is nuclear to the formation 
of neurosis and all but recants his 
“‘two-factor’’ theory of learning. 

The most disappointing presenta- 
tion is given by Mowrer. He not only 
moves too rapidly through the topics 
of learning, neurosis, therapy, reli- 
gion, and law, but also refers to his 
previous publications excessively, a 
practice which is particularly dis- 
concerting since the bibliography of 
his paper was omitted. (This error 
has been corrected in later printings 
of the book.) 

This volume should prove valuable 
reading to all psychologists and stu- 
dents interested in problems of moti- 
vation. 

BERNARD SAPER 

Northwestern University 


McCLELLAND, D. C., ATKINSON, 
J. W., Crark, R. A., & LOWELL, 
E. L. The achievement molive. New 
York: Appleton-Century-Crofts, 
1953. Pp. xxii+384. $6.00. 


This book is primarily a progress 
report on a program of research 
which thus far has shown very prom- 


ising results. The authors have 
worked out a scheme for measuring 
the achievement motive under stand- 
ardized conditions, thus making 
available for controlled study one of 
the most important human strivings. 
They have guided their plough into 
rich new ground in the field of moti- 
vation, and this should help compen- 


sate for diminishing yields from stud- 
ies of animal drive. It seems a sig- 
nificant omen that already the au- 
thors are dissatisfied with existing 
theories of motivation. They devote 
nearly a hundred pages to hammering 
out a new theory more adequate to 
their findings concerning a distinc- 
tively human motive. 

The device for measuring the 
achievement motive is derived from 
the Thematic Apperception Test: 
subjects are asked to write stories in 
response to four pictures. The re- 
sponses, however, are considerably 
restricted; they must be written in 
four minutes and entered under four 
printed questions. The experimenter 
keeps announcing the time, and the 
atmosphere is much like that of a 
school examination. These circum- 
stances, scarcely conducive to free 
fantasy, prove admirably contrived 
to produce a measure of achievement 
motivation. The measure comes 
from the amount of achievement 
imagery found in the four stories, 
imagery concerned with “success in 
competition with some standard of 
excellence.’’ The scoring scheme has 
been worked out with great care and 
is both efficient and reliable. The 
validity of the measure is attested by 
comparing the results obtained under 
relaxed, almost playful conditions 
with those obtained under achieve- 
ment-oriented conditions, when some 
preceding tests were offered as serious 
measures of intelligence and capacity 
for leadership. Under the latter con- 
ditions the stories were significantly 
more saturated with achievement 
imagery. This basic experiment was 
repeated with several different 
groups, including a group of Navaho 
boys. It succeeded everywhere with 
male subjects, but it failed with girls. 
When the instructions were modified 
by substituting social acceptability 
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for intelligence and leadership, girls 
also gave the expected increase in 
achievement imagery. 

When individual differences are to 
be studied, the amount of achieve- 
ment imagery shown under achieve- 
ment-oriented conditions is consid- 
ered the best measure. In a variety 

, performance tests high achieve- 

q thBibjects gave evidence of 
greater involvement and greater suc- 
cess. They also gave evidence, 
through questionnaires, that achieve- 
ment motives develop “in families 
where there is an emphasis on the in- 
dependent development of the indi- 
vidual.”’ This finding was fortified by 
an investigation by Winterbottom, 
who tested a group of young boys and 
also interviewed their mothers. She 
found that the mothers of “highs” 
expected their children ‘‘to have met 
independence demands much earlier 
in life’ than the mothers of “lows.” 
All these evidences combine to show 
that the authors’ achievement index 
measures something that is impor- 
tant in the subjects’ behavior and that 
has intelligible roots in their up- 
bringing. 

The only sour note in this sym- 
phony of significant results comes 
when over-all judgments of achieve- 
ment motivation are used.  Self- 
estimates of achievement striving 
failed to correlate with the index; 
worse still, individual study by a 
psychiatrist and clinical psychologist 
failed to agree with the results of the 
standard test. The authors are “in- 
clined to interpret these results as 
meaning that people’s perception of 
achievement motivation and achieve- 
ment motivation itself are two dif- 
ferent things.’’ Does this not imply 
that the achievement index is some- 
how a more true measure of achieve- 
ment motivation, whereas the clini- 
cal judgment is only a person's falli- 
ble perception of it? The index, too, 


is a fallible perception, based on a 
tiny sampling of thoughts by a 
method not equally congenial to each 
person tested. The authors recognize 
that the index can be an imperfect 
measure, especially in those cases 
where anxiety tends to inhibit 
achievement imagery; but their par- 
donable pride in their successful child 
should not trap them into an assur- 
ance that it is the true measure, or 
even the best measure. To say that 
we measure the achievement motive 
is metaphorical, not literal. Our 
measures give us but hints of what 
lies hidden in the person being tested. 
It is enough glory for this book to 
have shown a significant relation be- 
tween this hint and a variety of other 
hints; we can be confident that the 
index has something to do with 
achievement motivation. 

In their theory of motivation the 
authors are strongly influenced by 
Hebb: “our own theory in many re- 
spects picks up where he leaves off.” 
Their system, moreover, follows 
Young’s lead in basing motives on af- 
fective arousal. All motives are 
learned; what is unlearned is affect, 
which occurs whenever stimuli or 
situations produce a_ significant 
change in the organism's adaptation 
level. Changes in adaptation level, 
with their accompanying affect, be- 
come the basis for learning motives; 
“‘a motive is the redintegration by a 
cue of a change in an affective situa- 
tion.”” These few sentences must suf- 
fice here to indicate the general na- 
ture of the theory, which though still 
tentative is worked out in considera- 
ble detail with abundant reference to 
current research. To me it seems an 
important broadening of the concept 
of motivation, similar to Gardner 
Murphy’s view that “all tissues are 
motive centers,"’ and I rejoice that 
this theory, too, takes us out of the 
conceptual dreamworld of visceral 
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drives into a waking theory of human 
motivation. 


RosBert W. WHITE 
Harvard University 


ANASTASI, ANNE. Psychological test- 
ing. New York: Macmillan, 1954. 
Pp. xiii+682. $6.75. 


This book is naturally to be com- 
pared within the frame of reference 
provided by similar works written by 
Super, Cronbach, Goodenough, and 
Greene. In this context, Anastasi’s 
book will definitely hold its own. It 
is scholarly, thoughtful, and thor- 
ough. Its coverage is broad, yet there 
has necessarily been selection. The 
selection has been generally good and 
the emphases well placed. 

As an example of textbook organi- 
zation and writing, this volume may 
well serve as a model. In spite of 
good writing, however, the reading 
will not be found easy except by the 
better students. Many well-selected 
illustrations are incorporated and 
effectively used. Documentation is 
quite adequate. 

The author seems to be right at 
home when discussing such disparate 
subjects as factor analysis and pro- 
jective techniques; measures of ar- 
tistic aptitude and situational tests; 
infant tests and measures of deterior- 
ation. Her evaluations are forthright 
and unafraid and rarely overlook sig- 
nificant points that can be made. I 
doubt that anyone can properly 
charge her with being unfair. She 
brings into high relief the great 
amount of floundering that has oc- 
curred in connection with attempts to 
measure deterioration, with projec- 
tive techniques, and with situational 
tests. 

On a continuum that extends be- 
tween extreme rational versus ex- 
treme empirical approaches to test 
development, she is somewhat on the 


empirical side. This is shown in her 
ubiquitous demands ior data on prac- 
tical validity. She does recognize the 
roles of content validity and factorial 
validity, but they are of secondary 
importance. 

Like most authors who write about 
tests, she thinks of them entirely as 
measures of individual differences. 
It is time that we broaden this con- 
ception and recognize that ‘‘occa- 
sional differences” are also measured 
by means of tests and that the experi- 
mental psychologist is perpetually 
using tests for this purpose. 

In only one important place does 
the author seem uncritical: when she 
accepts the ‘projective line’ that 
projective tests take the “global ap- 
proach.” “Global’’ is a good-sound- 
ing word, but few writers clearly de- 
fine it or think through the implica- 
tions of it. While the objective of a 
“global approach” may be to obtain 
a greater coverage of an individual's 
personality and to view  intra- 
individual relationships, a necessary 
step is to describe him in terms of 
trait names. The same objective can 
also be achieved through other than 
projective tests. 

It is difficult to find flaws in this 
book, technical or otherwise. One or 
two may be suggested, though some 
are admittedly debatable. 

The heading ‘General Classifica- 
tion Tests’ of Part II will be mis- 
leading to some readers. As used, it 
refers to classification in educational 
and social groups, not classification in 
job assignments. 

In discussing the factors in the 
Stanford-Binet tests, the author, for 
some reason, bases her conclusions on 
the abortive analyses of McNemar 
rather than on the more complete 
analyses of Jones. Even the Jones 
analyses probably fail to do justice to 
the number of factors represented— 
through no fault of his. It would re- 
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quire additional reference tests to do 
the job right. 

A debatable point, which Anastasi 
recognizes, is the false dichotomy 
that has persisted between aptitude 
tests and personality tests. Surely an 
individual's abilities are a part of his 
personality. Perhaps we need a con- 
test and a prize for a good term em- 
bracing the nonaptitude aspects of 
personality. 

J. P. GutILrorp 

University of Southern California 


Fink, HAROLD K. Mind and per- 
formance: a comparative study of 
learning in mammals, birds, and 
reptiles. New York: Vantage 
Press, 1954. Pp. xi+113. $3.00. 


One of the objectives of compara- 
tive psychology, as its name implies, 
is fo compare the sensory and motor 
abilities of animals to those of man. 
Yet surprisingly few investigators 
have devoted themselves seriously to 
this problem. The present mono- 


graph describes a five-year research 
which examines the learning ability 
of seven species of mammals includ- 
ing man, chickens, and turtles and 


tortoises. In terms of the number of 
different species studied it is the most 
comprehensive of any of the truly 
comparative investigations of animal 
behavior. 

Learning efficiency is measured 
with a device called the ‘Arrow 
Maze,’ which is a kind of cross be- 
tween the more orthodox type of 
maze and a four-way multiple-choice 
box. Controls or allowances are made 
for variations in motivation and for 
structural differences between the 
organisms. The results are given in 
a series of quotients, analogous to the 
1Q, in which the learning of the infra- 
humans is expressed as a percentage 
of the learning efficiency of man. 
Thus the pig learns the Arrow Maze 
50 per cent as fast as man, while the 


learning ability of the turtle is only 
about 15 per cent that of man. The 
ranking of the various organisms by 
this procedure is man, the domestic 
pig, dog, goat, white rat, chicken, 
rabbit, cat, turtles and _ tortoises 
Sex differences show up consistently 
in favor of the male wherever experi- 
mental conditions permit a compari- 
son—namely in the case of man, the 
pig, the dog, the cat, and the chick. 

The book is well and simply writ- 
ten in language which can be under- 
stood by the intelligent layman and 
is at the same time acceptable to the 
technically trained psychologist. It 
is a comprehensive and significant 
contribution and will make an im- 
portant addition to the literature of 
the field. 

W. N. KELLOGG 
Florida State University 


BELLOws, ROGER M., & Estep, M. 
FRANCIS. Employment psychology: 
the interview. New York: Rinehart & 
Company, Inc., 1954. Pp. ix+295. 
$4.25. 


The authors of this volume take 
the position that if the interview is 
delimited to appraising the psycho- 
logical make-up of the applicant dur- 
ing the interview per se, very little 
can be done to improve employee 
selection by interviewing. If, how- 
ever, the interview becomes part of 
a general view of the candidate for 
employment and includes related ma- 
terial such as trade tests, application 
blank data, and other records, there is 
a definite possibility for improving the 
interviewing process asa selection tool. 

Accordingly, the authors have lit- 
tle to say about the interview per se 
aside from a few conventional com- 
ments about “how to interview.” 
Included in this advice are comments 
such as “It is evident that the inter- 
viewer should prepare himself for the 
interview” (p. 90); “It has been 
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known for several decades that the 
form of the question is of considerable 
importance” (p. 92); “It is hardly 
necessary to add that the interview 
should be terminated in such a way 
as to leave a friendly impression on 
the part of the applicant”’ (p. 93). 

The emphasis, however, is on what 
the authors call the “ancillary” proc- 
esses in employment interviewing. 
Included in these processes are: con- 
struction of application blanks, in- 
terpretation of personal data in se- 
lection of employees, and the use of 
trade tests as interview aids. The 
treatment of these topics is adequate. 
In addition, certain elementary sta- 
tistical tools for use with these ancil- 
lary techniques are presented. It is 
possible these may confuse rather 
than enlighten the audience for which 
the book is intended. The method of 
scaling personal data items simply on 
the basis of percentage of population 
falling in various categories is open to 
question especially since no consider- 
ation is given to the possible signifi- 
cance of the difference of these per- 
centages. 

The book is intended for profes- 
sional interviewers and students of 
applied psychology and requires no 
previous training in psychological 
method. The language and concepts 
are clear enough for the intended 
audience; but in reaching for this 
audience the authors have used a 
style of writing which is soporific. 

The professional psychologist will 
find little new about the interview in 
this volume although the excellent 
bibliography wiil be useful. He will 
wonder why the book is not entitled 
“Employment Psychology: Why In- 
terview?”’ since it contains little or no 
evidence that the interview divorced 
from its supporting processes is of any 
value in the selection of personnel. 

WILLIAM McGEHEE 

Fieldcrest Milis, Inc. 


FRUCHTER, BENJAMIN. Jntroduction 
to factor analysis. New York: Van 
Nostrand, 1954. Pp. xii+280. 
$5.00. 


Unlike a number of other books 
dealing with factor analytic tech- 
niques, this one is elementary, eclec- 
tic, and relatively nonemotional. It 
should serve well to help bridge the 
gap between the mathematical nai- 
veté and the matrical sophistication 
demanded by most modern presen- 
tations. Step-by-step factor extrac- 
tion examples are worked out for 
cluster analysis, the diagonal meth- 
od, the centroid method, the multi- 
ple-group method, and the principal- 
axes method. Schemes for rotation 
to orthogonal or oblique axes are de- 
scribed in detail. Exercises and an- 
swers are provided for each method, 
The fact that the book is largely a 
manual of computational mechanics 
is emphasized by a five-page chapter 
on interpretation of factors and the 
relative paucity of material on choice 
of method. This imbalance is, how- 
ever, somewhat remedied by a chap- 
ter on applications of factor analysis 
in the literature which illustrates the 
utility of factor analysis as a research 
tool. Reference is made to Q- and 
P-technique and other variations in 
study design. A final chapter brings 
together a number of general con- 
siderations regarding the use and 
potential status of factor analysis. 
This book is geared to a first course 
but should also be useful for self- 
instruction within the limitation that 
“rotation is the aspect of factor anal- 
ysis, which is the most difficult to 
teach and the most difficult to learn, 
because it requires considerable back- 
ground, insight, and skill.” 

LEONARD S. KOGAN 


Institute of Welfare Research, 
Community Service Society of 
New York 
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Aususe., Davip P. Theory and prob- 
lems of adolescent development. 
New York: Grune & Stratton, 
1954. Pp. xviii+580. $10.00 


This is not just another book on 
the adolescent. Like many other 
books in this field, all the important 
problems and concepts related to 
adolescent behavior are covered. Un- 
like most other books in this area, 
however, it offers a systematic pres- 
entation of psychological knowledge 
woven around a dynamic point of 
view. It is a scholarly book written 
in language that makes for easy and 
interesting reading. 

The author opens his book with an 
organized set of arguments designed 
to convince his readers that there is 
justification for a book on the “Psy- 
chology of Adolescent Development.” 
His major premise states that “ado- 
lescence is treated as a separate 
developmental period not because 
it covers a decade but because it 
spans an interval in which distinctive 
changes occur in a biosocial status of 
the human organism. The adolescent 
years can be grouped together be- 
cause they possess a certain homoge- 
neity of developmental content” (p. 
6). 

The dynamic concept of “ego- 
involvement”’ integrates all of the 17 
chapters of the book. The reader is 
oriented to this point of view in 
Chapter 1 (p. 23) when the author 
writes the following: 

We may expect to find ego-related charac- 
teristics such as these constituting the core of 
personality structure: an individual's concep 
tion of his own importance; his aspirations for 
self-enhancement; the sources from which he 
desires status; the degree of independence 
characterizing his decisions; the notions he has 
about his ability to control his environment; 
the degree of dependence-independence char- 
acterizing his relations with others; his meth- 
ods of assimilating new values; his concept of 
his own capacity for doing things for himself; 
his self-esteem and feelings of security; his 
ability to withstand frustrations; his ability 
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to judge himself realistically; his need for 
pleasure and immediate gratification; his 
sense of moral obligation and responsibility; 
the types of defenses he uses when his security 
or self-esteem is threatened. 


These aspects of the adolescent’s 
personality structure are constantly 
referred to throughout the book. 

Although Dr. Ausubel gives a 
psychoanalytic flavor to his writing, 
this is by no means a Freudian book. 
Such important concepts as the id- 
ego-superego structure are only brief- 
ly mentioned. There is a short dis- 
cussion on the psychoanalytic ap- 
proach to adolescence in Chapter 1. 
Yet the concept of ego-involvement 
is presented in almost every chapter 
in the book. It is this orientation 
that makes Dr. Ausubel’s book a 
welcome addition to the textbooks on 
adolescent development. 

There is also a mental hygiene 
emphasis in this book. Time and 
again the author presents mental 
health concepts and techniques. For 
example, the chapter called The Ado- 
lescent Peer Culture contains a sec- 
tion on “Helping the Socially Re- 
jected Individual,’’ and the chapter 
called Sexual Behavior in Adolescence 
has a discussion on “‘Sex Guidance.’ 
Part Four of the book is devoted en- 
tirely to the mental health of the 
adolescent. 

The author has marshalled his 
psychological knowledge and psy- 
chiatric experience into a well-organ- 
ized book. Part I, “Introduction,” 
deals with the general field of ado- 
lescent behavior and development. 
Part II, called ‘Psychobiological 
Problems,”’ comprises the largest 
part of the book. Part III is called 
“Psychosocial Problems,"’ and Part 
IV is designated “‘Maladjustment and 
Mental Hygiene.”” Chapters 7, 8, 
and 9 in Part Il—Personality Matur- 
ation During Adolescence, Parent- 
Child Relationships in Adolescence, 
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and Moral and Religious Develop- 
ment—would appear to be more 
properly embodied in Part II where 
the author deals with psychosocial 
problems. 

Dr. Ausubel quotes freely from 
other authors and from his own 
writings and research. There are 
extensive references contained in the 
bibliography at the end of each chap- 
ter. These numerous quotations and 
references appear to compensate for 
the dearth of charts, graphs, and 
tables in this book. Although there 
are hundreds of quotations and over 
a thousand references in the bibliog- 
raphy, there are only three figures in 
the entire book. Case histories are 
also conspicuously absent in this 
book. This is somewhat surprising 
since Dr. Ausubel, who is clinically 
trained, writes in the introduction 
that this textbook “‘is also intended 
as a reference work for professional 
workers in child development, clini- 
cal psychology, pediatrics, psychia- 
try, guidance, and education.” 

Although the author states that 
“this book is primarily intended as an 
advanced textbook in adolescent 
psychology for graduate students in 
psychology and education,” it would 
appear that this book is written with 
sufficient clarity and nontechnical 
language to be used in any college 
course dealing with adolescent be- 
havior. It would be especially de- 
sirable in clinically-oriented courses. 
This reviewer believes that this book 
will find a ready market if the price 
will not detract from its sales. 

BARNEY Katz 

Los Angeles, California 


Bross, Irwin D. J. Design for deci- 
ston. New York: Macmillan, 1953. 
Pp. viiit276. $4.25. 


Some Americcn statisticians main- 
tain that in Wald’s decision theory 
modern statistics, as a system of in- 
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ductive logic, has progressed as far 
beyond Fisher as Fisher advanced it 
beyond the Pearsonian era. In this 
book, Bross successfully describes, in 
a nontechnical style, how statistical 
tests and estimation relate to the 
broad modern conceptions of statisti- 
cal decision and game theory. This 
he does with frequent humorous, or 
even facetious asides at targets rang- 
ing from mathematicians, philoso- 
phers, and kings down to the lowly 
sportscaster. His advice, frequently 
offered, is rarely inane, “‘... try to 
cultivate a critical mind...” (p. 3); 
more frequently it is sound and with 
broad philosophical implications, ‘‘it 
is well to remember the following rule 
for working with models: A model is 
neither true nor false.”” (p. 172). To 
all but the sophisticated there will 
be new information as well as new 
ideas; for example the actual fre- 
quency of Type I errors or the impli- 
cations of high frequency of Type I] 
errors. 

The book is singularly free of error, 
because Bross is capable of dealing 
with each item at a far more technical 
level than was required for his pres- 
ent task. I do believe he could have 
let his readers know, in many in- 
stances, that there are effective stan- 
dard mathematical methods for ob- 
taining decision makers. Also, he 
fails to clarify the differences be- 
tween experiments and normative 
studies with all their critical implica- 
tions. Once or twice Bross gets a 
little too enthusiastic, implying that 
decision theory is entirely objective; 
it is, after you have chosen a value 
system and selected permissible risks. 

I highly recommend Design for De- 
cision to all who want a painless in- 
jection of the simple, basic ideas 
which have revolutionized modern 
statistics. 

Davip A. GRANT 

University of Wisconsin 
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